Add Zamba2 #34517

ArthurZucker merged 90 commits into huggingface:main from Zyphra:zamba2
pglorio
pglorio First commit
acd25b74
pglorio Finish model implementation
70639b84
pglorio First commit
d111b988
pglorio Finish model implementation
8f36dba7
pglorio Merge branch 'zamba2' of https://github.com/Zyphra/transformers_zamba…
f0c547cd
pglorio Register zamba2
700fbf03
pglorio pglorio marked this pull request as draft 1 year ago
pglorio generated modeling and configuration
70a60219
pglorio Merge pull request #2 from Zyphra/main
88c4b26e
pglorio generated modeling and configuration
685906a0
pglorio added hybrid cache
4da8d5ff
pglorio fix attention_mask in mamba
6b5a9be2
pglorio dropped unused loras
248350d6
pglorio fix flash2
d1d2c668
pglorio Merge pull request #3 from Zyphra/main
eb6063e8
config docstrings
5f5d01ea
pglorio fix config and fwd pass
c1b7647f
pglorio make fixup fixes
979b99bf
pglorio text_modeling_zamba2
9d9b2eb7
pglorio Merge pull request #4 from Zyphra/main
3a457f58
pglorio small fixes
549d4cb4
pglorio make fixup fixes
987bba9f
pglorio Merge pull request #5 from Zyphra/main
ffc2a58f
pglorio Fix modular model converter
9adf85e0
pglorio
ArthurZucker
ArthurZucker commented on 2024-11-14
pglorio added inheritances in modular, renamed zamba cache
904da4e9
pglorio pglorio force pushed to 904da4e9 1 year ago
pglorio Merge pull request #6 from Zyphra/main
47259837
pglorio modular rebase
0be27d74
pglorio Rebase
cc0c5493
pglorio new modular conversion
ac77a097
pglorio fix generated modeling file
e59980e3
pglorio fixed import for Zamba2RMSNormGated
73a647aa
pglorio modular file cleanup
c2b72a5b
pglorio rebase
0eb39a5d
pglorio make fixup and model tests
10a0b1e1
pglorio dropped inheritance for Zamba2PreTrainedModel
0270667f
pglorio make fixup and unit tests
189c8c54
pglorio
ArthurZucker
Cyrilvallez Cyrilvallez requested a review from Cyrilvallez Cyrilvallez 1 year ago
Cyrilvallez
Cyrilvallez
Cyrilvallez commented on 2024-12-03
pglorio Add inheritance of rope from GemmaRotaryEmbedding
fa5f79e8
pglorio moved rope to model init
8079ae03
pglorio drop del self.self_attn and del self.feed_forward
d6206ebd
pglorio Rebase onto upstream
f8326993
pglorio fix tests
cf613b71
pglorio
pglorio renamed lora -> adapter
337faed6
pglorio rewrote adapter implementation
f1b31a13
pglorio rebase
8925c159
pglorio fixed tests
11fdd47a
pglorio
Cyrilvallez
Cyrilvallez commented on 2024-12-18
Cyrilvallez
huggingface huggingface deleted a comment from github-actions on 2024-12-18
pglorio Merge branch 'main' into zamba2
02dd0427
pglorio Fix torch_forward in mamba2 layer
5d0a5d46
pglorio Fix torch_forward in mamba2 layer
ef055c90
pglorio Fix torch_forward in mamba2 layer
b993a789
pglorio Dropped adapter in-place sum
bf93251a
pglorio removed rope from attention init
99708af8
pglorio updated rope
d9b4a500
pglorio created get_layers method
095d853b
pglorio
pglorio rebase
10ebad5d
pglorio make fixup fix
99e343e6
pglorio make fixup fixes
4e409757
pglorio make fixup fixes
61bb32fa
pglorio fix merge conflicts
bb9b24ba
pglorio update to new attention standard
cb90bb4e
pglorio fixes for merge
8ed701e9
pglorio update to new attention standard
1dbc8c73
pglorio make fixup fixes
f24e4525
pglorio
Cyrilvallez
Cyrilvallez commented on 2025-01-15
pglorio rebase
676f8628
pglorio minor fixes
2b29338b
pglorio cache_position
b212cb28
pglorio removed cache_position postion_ids use_cache
1e3b51e5
pglorio
pglorio remove config from modular
5ace701e
pglorio removed config from modular (2)
535b6319
pglorio rebase
5a16aa98
pglorio import apply_rotary_pos_emb from llama
1c92266d
pglorio fixed rope_kwargs
99bde938
pglorio Instantiate cache in Zamba2Model
baf2ed3f
pglorio fix cache
9afb57ec
pglorio fix @slow decorator
d1687f91
pglorio
pglorio rebase
4299889e
Cyrilvallez
Cyrilvallez approved these changes on 2025-01-20
pglorio
pglorio rebase
a0545bf8
pglorio small fix in modular file
903f6dc6
ArthurZucker
ArthurZucker commented on 2025-01-21
pglorio Update docs/source/en/model_doc/zamba2.md
14396d74
pglorio several minor fixes
02f58079
pglorio inherit mamba2decoder fwd and drop position_ids in mamba
bfb02675
pglorio removed docstrings from modular
b2229430
pglorio rebase
b114ad85
pglorio reinstate zamba2 attention decoder fwd
929ee67b
pglorio use regex for tied keys
9007a522
pglorio Revert "use regex for tied keys"
f701dbd4
pglorio use regex for tied keys
87b938b4
pglorio add cpu to slow forward tests
5e092909
pglorio dropped config.use_shared_mlp_adapter
8ed23534
pglorio
ArthurZucker
ArthurZucker approved these changes on 2025-01-24
pglorio Update docs/source/en/model_doc/zamba2.md
a9bbd9c1
pglorio
pglorio rebase
1e827574
pglorio re-convert from modular
37bff341
ArthurZucker ArthurZucker marked this pull request as ready for review 1 year ago
ArthurZucker ArthurZucker merged 33cb1f7b into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone