transformers
Fix RMSNormGated in Zamba2
#35943
Merged

Fix RMSNormGated in Zamba2 #35943

ArthurZucker merged 99 commits into huggingface:main from Zyphra:zamba2
pglorio
pglorio First commit
acd25b74
pglorio Finish model implementation
70639b84
pglorio First commit
d111b988
pglorio Finish model implementation
8f36dba7
pglorio Merge branch 'zamba2' of https://github.com/Zyphra/transformers_zamba…
f0c547cd
pglorio Register zamba2
700fbf03
pglorio generated modeling and configuration
70a60219
pglorio Merge pull request #2 from Zyphra/main
88c4b26e
pglorio generated modeling and configuration
685906a0
pglorio added hybrid cache
4da8d5ff
pglorio fix attention_mask in mamba
6b5a9be2
pglorio dropped unused loras
248350d6
pglorio fix flash2
d1d2c668
pglorio Merge pull request #3 from Zyphra/main
eb6063e8
config docstrings
5f5d01ea
pglorio fix config and fwd pass
c1b7647f
pglorio make fixup fixes
979b99bf
pglorio text_modeling_zamba2
9d9b2eb7
pglorio Merge pull request #4 from Zyphra/main
3a457f58
pglorio small fixes
549d4cb4
pglorio make fixup fixes
987bba9f
pglorio Merge pull request #5 from Zyphra/main
ffc2a58f
pglorio Fix modular model converter
9adf85e0
pglorio added inheritances in modular, renamed zamba cache
904da4e9
pglorio Merge pull request #6 from Zyphra/main
47259837
pglorio modular rebase
0be27d74
pglorio Rebase
cc0c5493
pglorio new modular conversion
ac77a097
pglorio fix generated modeling file
e59980e3
pglorio fixed import for Zamba2RMSNormGated
73a647aa
pglorio modular file cleanup
c2b72a5b
pglorio rebase
0eb39a5d
pglorio make fixup and model tests
10a0b1e1
pglorio dropped inheritance for Zamba2PreTrainedModel
0270667f
pglorio make fixup and unit tests
189c8c54
pglorio Add inheritance of rope from GemmaRotaryEmbedding
fa5f79e8
pglorio moved rope to model init
8079ae03
pglorio drop del self.self_attn and del self.feed_forward
d6206ebd
pglorio Rebase onto upstream
f8326993
pglorio fix tests
cf613b71
pglorio renamed lora -> adapter
337faed6
pglorio rewrote adapter implementation
f1b31a13
pglorio rebase
8925c159
pglorio fixed tests
11fdd47a
pglorio Merge branch 'main' into zamba2
02dd0427
pglorio Fix torch_forward in mamba2 layer
5d0a5d46
pglorio Fix torch_forward in mamba2 layer
ef055c90
pglorio Fix torch_forward in mamba2 layer
b993a789
pglorio Dropped adapter in-place sum
bf93251a
pglorio removed rope from attention init
99708af8
pglorio updated rope
d9b4a500
pglorio created get_layers method
095d853b
pglorio rebase
10ebad5d
pglorio make fixup fix
99e343e6
pglorio make fixup fixes
4e409757
pglorio make fixup fixes
61bb32fa
pglorio fix merge conflicts
bb9b24ba
pglorio update to new attention standard
cb90bb4e
pglorio fixes for merge
8ed701e9
pglorio update to new attention standard
1dbc8c73
pglorio make fixup fixes
f24e4525
pglorio rebase
676f8628
pglorio minor fixes
2b29338b
pglorio cache_position
b212cb28
pglorio removed cache_position postion_ids use_cache
1e3b51e5
pglorio remove config from modular
5ace701e
pglorio removed config from modular (2)
535b6319
pglorio rebase
5a16aa98
pglorio import apply_rotary_pos_emb from llama
1c92266d
pglorio fixed rope_kwargs
99bde938
pglorio Instantiate cache in Zamba2Model
baf2ed3f
pglorio fix cache
9afb57ec
pglorio fix @slow decorator
d1687f91
pglorio rebase
4299889e
pglorio rebase
a0545bf8
pglorio small fix in modular file
903f6dc6
pglorio Update docs/source/en/model_doc/zamba2.md
14396d74
pglorio several minor fixes
02f58079
pglorio inherit mamba2decoder fwd and drop position_ids in mamba
bfb02675
pglorio removed docstrings from modular
b2229430
pglorio rebase
b114ad85
pglorio reinstate zamba2 attention decoder fwd
929ee67b
pglorio use regex for tied keys
9007a522
pglorio Revert "use regex for tied keys"
f701dbd4
pglorio use regex for tied keys
87b938b4
pglorio add cpu to slow forward tests
5e092909
pglorio dropped config.use_shared_mlp_adapter
8ed23534
pglorio Update docs/source/en/model_doc/zamba2.md
a9bbd9c1
pglorio rebase
1e827574
pglorio re-convert from modular
37bff341
pglorio resolve merge conflicts
8e0084ce
pglorio extended Zamba2RMSNormGated to n_groups>1
cd304b51
pglorio removed einops import
8f2eb7b9
pglorio set _supports_sdpa = True
be7d81ac
pglorio pglorio changed the title Zamba2 Fix RMSNormGated in Zamba2 1 year ago
vasqu
vasqu commented on 2025-01-29
Rocketknight1
pglorio rebase
de9a4427
pglorio add use_mem_eff_path flag for fused mamba2 fwd
84fbead9
pglorio rebase
6a6ab330
pglorio added docstring for use_mem_eff_ath flag
355bb4c7
ArthurZucker
ArthurZucker commented on 2025-02-04
vasqu
pglorio rebase
5af59547
ArthurZucker
vasqu
ArthurZucker
ArthurZucker approved these changes on 2025-02-04
ArthurZucker ArthurZucker merged a93b8058 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone