Add Zamba #30950

ArthurZucker merged 115 commits into huggingface:main from Zyphra:main
pglorio
amyeroberts amyeroberts added New model
pglorio
ArthurZucker
younesbelkada
younesbelkada commented on 2024-05-23
amazingvince
pglorio
pglorio
amazingvince
Quentin-Anthony
ArthurZucker
amazingvince
younesbelkada
younesbelkada commented on 2024-06-10
pglorio
younesbelkada
younesbelkada commented on 2024-06-19
pglorio pglorio force pushed 1 year ago
pglorio pglorio force pushed 1 year ago
pglorio
ArthurZucker
pglorio pglorio force pushed 1 year ago
pglorio
ArthurZucker
pglorio pglorio force pushed 1 year ago
pglorio Update index.md
7eff1ccd
pglorio Rebase
14961a27
pglorio Rebase
b67ff241
pglorio Updates from make fixup
0aa10032
pglorio Update zamba.md
5e886530
pglorio Batched inference
123d9597
pglorio Update
f35bdf93
Fix tests
1ec90d1f
pglorio Fix tests
4d3f8c07
pglorio Fix tests
e51113da
pglorio Fix tests
cf6ee16c
pglorio pglorio force pushed to cf6ee16c 1 year ago
ArthurZucker
ArthurZucker commented on 2024-09-03
pglorio Update docs/source/en/model_doc/zamba.md
f80b813a
pglorio Update docs/source/en/model_doc/zamba.md
c010a68f
pglorio Update configuration_zamba.py
9c3abc87
pglorio Update src/transformers/models/zamba/modeling_zamba.py
5d3d6154
pglorio Update src/transformers/models/zamba/modeling_zamba.py
d245749b
pglorio Update src/transformers/models/zamba/modeling_zamba.py
663343de
pglorio Update src/transformers/models/zamba/modeling_zamba.py
b3540eaa
pglorio Update modeling_zamba.py
554c14c2
pglorio Update modeling_zamba.py
c9c97fd6
pglorio Update modeling_zamba.py
ec9edd7c
pglorio Update configuration_zamba.py
939d6a91
pglorio Update modeling_zamba.py
c26addd3
pglorio Update modeling_zamba.py
e3c93f02
pglorio Merge branch 'main' of https://github.com/Zyphra/transformers_zamba
58d8c2dd
pglorio Update ZambaForCausalLM
396ebff5
pglorio Update ZambaForCausalLM
df8dfd30
pglorio Describe diffs with original mamba layer
4ab88a29
pglorio Moved mamba init into `_init_weights`
1a521de3
pglorio Moved mamba weight init into _init_weights
767a5911
pglorio Update index.md
d5b2beb2
pglorio Rebase
029813b9
pglorio Rebase
bec7dce7
pglorio Updates from make fixup
db153483
pglorio Update zamba.md
6c7f812e
pglorio Batched inference
c3766ba7
pglorio Update
dff24b86
Fix tests
0e9f3c94
pglorio Fix tests
245d9d9c
pglorio Fix tests
8aedd305
pglorio Fix tests
f8ed17a1
pglorio Update docs/source/en/model_doc/zamba.md
a5d5873c
pglorio Update docs/source/en/model_doc/zamba.md
17cef258
pglorio Update configuration_zamba.py
f773f120
pglorio Update src/transformers/models/zamba/modeling_zamba.py
c5852aa0
pglorio Update src/transformers/models/zamba/modeling_zamba.py
da64b363
pglorio Update src/transformers/models/zamba/modeling_zamba.py
7679578d
pglorio Update src/transformers/models/zamba/modeling_zamba.py
85fe7cb1
pglorio Update modeling_zamba.py
6c949c6e
pglorio Update modeling_zamba.py
f78b627a
pglorio Update modeling_zamba.py
3b3605a9
pglorio Update configuration_zamba.py
f6fc1e8f
pglorio Update modeling_zamba.py
d5b8d6e5
pglorio Update modeling_zamba.py
c2428a47
pglorio Merge branch 'main' of https://github.com/Zyphra/transformers_zamba
bbc9a8e3
pglorio Update ZambaForCausalLM
b13fdde1
pglorio Moved mamba init into `_init_weights`
037b9380
pglorio Update ZambaForCausalLM
9a1ef16a
pglorio Describe diffs with original mamba layer
d9d436c4
pglorio Merge branch 'main' of https://github.com/Zyphra/transformers_zamba
0bbb6c9a
make fixup fixes
91bc0767
pglorio quality test fixes
8f0100ff
pglorio Fix Zamba model path
7478e251
pglorio circleci fixes
a7c9d17a
pglorio circleci fixes
c2d097f6
pglorio circleci fixes
37881969
pglorio circleci fixes
1c6cca86
pglorio circleci fixes
911a78a2
pglorio circleci fixes
c6f2b3f9
pglorio circleci fixes
df931325
pglorio circleci fixes
211a5b50
pglorio circleci fixes
e0cb9fe3
pglorio Update
1df30bbd
pglorio circleci fixes
3d2800b8
pglorio
Quentin-Anthony Merge branch 'huggingface:main' into main
1e6f38be
Quentin-Anthony fix zamba test from merge
d93377d0
Quentin-Anthony
Quentin-Anthony fix ValueError for disabling mamba kernels
d01d80d9
ArthurZucker
ArthurZucker commented on 2024-09-23
Quentin-Anthony add HF copyright
b9e86b05
Quentin-Anthony shared_transf --> shared_transformer
4b0fb525
pglorio Update src/transformers/models/zamba/modeling_zamba.py
66b72c8e
pglorio Update src/transformers/models/zamba/modeling_zamba.py
d527a144
pglorio Fixes
97c646c2
pglorio Move attention head dim to config
1e4ffe6d
pglorio Fix circle/ci tests
2c53db25
Quentin-Anthony Merge branch 'huggingface:main' into main
9a1ad324
pglorio Update modeling_zamba.py
a7717f28
Quentin-Anthony apply GenerationMixin inheritance change from upstream
0fae3983
Quentin-Anthony apply import ordering
0304440f
Quentin-Anthony
Quentin-Anthony Merge branch 'huggingface:main' into main
3d9ec8e1
Quentin-Anthony Merge branch 'main' into main
cb1d1d9a
Quentin-Anthony
Quentin-Anthony Merge branch 'huggingface:main' into main
efcf16a0
ArthurZucker
Quentin-Anthony
ArthurZucker
ArthurZucker approved these changes on 2024-09-26
ArthurZucker
Quentin-Anthony update needed transformers version for zamba
339d4ccc
Quentin-Anthony add contribution author
a46a26ba
Quentin-Anthony add @slow to avoid CI
d0c1bc10
Quentin-Anthony Merge branch 'huggingface:main' into main
4fcd130e
pglorio Update src/transformers/models/zamba/modeling_zamba.py
8d299644
pglorio Define attention_hidden_size
0381c337
pglorio
Quentin-Anthony Merge branch 'huggingface:main' into main
75554d8b
pglorio Added doc for attention_head_size
a109b3fc
Quentin-Anthony trigger CI
9c10afe0
pglorio Fix doc of attention_hidden_size
18804555
Quentin-Anthony
Quentin-Anthony Merge branch 'huggingface:main' into main
daef5b0b
HuggingFaceDocBuilderDev
ArthurZucker ArthurZucker added run-slow
pglorio [run-slow] zamba
347f7614
Quentin-Anthony Merge branch 'huggingface:main' into main
634837f3
Quentin-Anthony Merge branch 'huggingface:main' into main
4e8db07e
HuggingFaceDocBuilderDev
hg0428
pglorio
ArthurZucker
ArthurZucker approved these changes on 2024-10-01
hg0428
ArthurZucker
pglorio Fixed shared layer logic, swapped up<->gate in mlp
15047741
pglorio fix shared layer logic, swap up<->gate in mlp
06e3a7a5
pglorio shared_transformer -> shared_transf
267530d9
pglorio reformat HybridLayer __init__
0a90fc70
Quentin-Anthony Merge branch 'huggingface:main' into main
fabaaecb
pglorio fix docstrings in zamba config
75f0d893
pglorio added definition of _get_input_ids_and_config
b9545eb4
pglorio fixed formatting of _get_input_ids_and_config
cdbd6906
pglorio
hg0428
Quentin-Anthony Merge branch 'huggingface:main' into main
6fabb6a7
Quentin-Anthony Merge branch 'huggingface:main' into main
b9f6cce8
Quentin-Anthony
ArthurZucker ArthurZucker merged f319ba16 into main 1 year ago
Quentin-Anthony
ArthurZucker
fakerybakery
ArthurZucker
hg0428

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone