[`Add Mamba`] Adds support for the `Mamba` models #28094
initial-commit
81c642f0
Merge branch 'main' of github.com:huggingface/transformers into add-m…
c50602bb
start cleaning
00d3a6c1
small nits
921bb24a
small nits
b3f216d2
current updates
7235b57f
add kernels
7a407a76
small refactoring little step
9f2a9829
add comments
04c991ab
styling
aa7e8d2b
nit
26748c4a
nits
75e376a3
Style
1c104b51
Merge
0e90daed
Small changes
a8044665
Push dummy mambda simple slow
6b87ad2c
nit
a7ec8d63
Use original names
50464518
Use original names and remove norm
b5831e3d
Updates for inference params
e9a80ad8
Style nd updates
ee4a7ef0
nits
d8c195fb
Match logits
e64fedc2
Add a test
aee558f3
Add expected generated text
eae5f452
nits doc, imports and styling
1f8e8d0a
style
3cc06e5b
oups
5a5324c2
Merge branch 'main' of github.com:huggingface/transformers into add-m…
325b66b5
dont install kernels, invite users to install the required kernels
81303f4d
let use use the original packages
1a103101
styling
89fb490b
nits
6cfe216c
fix some copieds
1ecbd223
update doc
b937122e
fix-copies
9752dd03
styling done
a7881a3c
nits
f445b0da
fix import check
64ec8dd6
run but wrong cuda ress
e6e3ba8c
mamba CUDA works :)
ed4eb4c8
fix the fast path
4c8fc48c
config naming nits
69e103fa
conversion script is not required at this stage
ba21ff24
finish fixing the fast path: generation make sense now!
fe537285
nit
9411169b
Let's start working on the CIs
c2c77096
style
1e73ca91
git push Merge branch 'main' of github.com:huggingface/transformers i…
834f46ff
Merge branch 'main' of github.com:huggingface/transformers into add-m…
a1a94f32
better style
22132229
more nits
2a020066
test nit
8b0412f3
quick fix for now
fbd6a2c0
nits
823f11a7
nit
88896a9a
nit
7f72ee81
Merge branch 'main' of github.com:huggingface/transformers into add-m…
05552474
nit
0072a6c2
nits
7f6c56f4
update test rest
f67c3533
fixup
2ab5a865
update test
8920be32
nit
87d0664f
some fixes
8b00d768
nits
ca9835cf
update test values
796ef3ef
fix styling
170664ac
nit
92493a05
support peft
854ebad5
Merge branch 'main' of github.com:huggingface/transformers into add-m…
3bbd1b14
integrations tests require torchg
aa0e6bb3
also add slow markers
3c1537ef
styling
d06421a6
chose forward wisely
5fb80623
nits
edb4e91a
update tests
eb1fb640
fix gradient checkpointing
de4fe46e
fixup
54ffaa3e
nit
977d34f0
fix doc
0928453b
check copies
2c90536a
fix the docstring
4ba9c792
fix some more tests
3651dbad
style
426e6f39
ArthurZucker
marked this pull request as ready for review 2 years ago
fix beam search
951b1aab
add init schene
4101369c
update
65db96bd
nit
0f3dfc71
fix
f8bd0aa8
fixup the doc
b2bd0c78
fix the doc
cf585294
fixup
e9c34472
tentative update but slow is no longer good
1282a75f
nit
fa561b26
should we always use float32?
91b81061
nits
e8142caf
revert wrong changes
623b6361
res in float32
566c799c
cleanup
5d637d9f
skip fmt for now
648a2922
update generation values
e306e891
update test values running original model
057d7a3d
fixup
72f8936c
update tests + rename inference_params to cache_params + make sure tr…
f415081d
small nits
6bb659a8
more nits
178fe76b
fix final CIs
3a46724e
style
13204e08
nit doc
1608a905
I hope final doc nits
99119ba2
nit
d6fb1efa
🫠
844530fd
final touch!
52be0185
fix torch import
d03de1c1
Apply suggestions from code review
c0672a8b
Apply suggestions from code review
dfc1212d
fix fix and fix
acd4ccf1
fix base model prefix!
2ddd9aad
nit
0c5d7eda
Update src/transformers/models/mamba/__init__.py
28e5ef07
Update docs/source/en/model_doc/mamba.md
f963e381
nit
095dabd6
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub