transformers
Add the Bamba Model
#34982
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
44
Changes
View On
GitHub
Add the Bamba Model
#34982
molbap
merged 44 commits into
huggingface:main
from
fabianlim:bamba-pr
fabianlim
marked this pull request as draft
1 year ago
fabianlim
changed the title
initial commit for PR
Add the Bamba Model
1 year ago
initial commit for PR
e2991328
fabianlim
force pushed
to
e2991328
1 year ago
rename dynamic cache
c1d2d2b4
Rocketknight1
assigned
molbap
1 year ago
add more unit tests
7c87f858
add integration test
2897866a
add integration test
5671778d
Add modular bamba file
947b877f
Merge branch 'bamba-pr' of https://github.com/fabianlim/transformers …
78c9b04b
Merge branch 'main' into bamba-pr
3e352f72
Remove trainer changes from unrelated PR
2c215721
Modify modular and cofig to get model running
a0e58b4c
molbap
added
State space models
molbap
added
New model
Fix some CI errors and beam search
a450f1c8
Fix a plethora of bugs from CI/docs/etc
146a940c
Add bamba to models with special caches
1144bbbd
Updat to newer mamba PR for mamba sublayer
856cb3a4
fix test_left_padding_compatibility
9ec6d15d
Merge remote-tracking branch 'upstream/main' into bamba-pr
d7875bef
fix style
895521df
fix remaining tests
a394a7d6
missed this test
f0604de7
ran make style
92ad669d
molbap
added
run-slow
move slow tag to integration obj
36370f32
make style
ecd7aff8
molbap
commented on 2024-12-16
address comments
94a13d6b
fix modular
4cff28c8
left out one part of modular
5d9ce5c5
change model
b934261c
fabianlim
marked this pull request as ready for review
1 year ago
Make Rotary modular as well
024072ac
Merge branch 'main' into bamba-pr
f7ceb0cb
molbap
approved these changes on 2024-12-16
Update bamba.md
0e97747f
Update bamba.md
53b8acfc
Update bamba.md
a8fa7ff6
Update bamba.md
0115bf6c
Merge pull request #4 from divya-kumari32/patch-1
44788dce
Add docs for config and model back
537964b4
Merge branch 'main' into bamba-pr
ab261616
Add warning when using fast kernels
ddd61182
replaced generate example
10887780
pcuenca
requested a review
from
ArthurZucker
1 year ago
ArthurZucker
approved these changes on 2024-12-18
Address comments from PR
d4c650ca
Merge branch 'bamba-pr' of https://github.com/fabianlim/transformers …
6f28b96f
Merge branch 'main' into bamba-pr
1c82bf06
Propagate attention fixes
9911cdf1
Fix attention interfaces to the new API
c7b50e60
Fix API for decoder layer
e0f34f57
Remove extra weights
bdd32720
molbap
merged
9613933b
into main
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
ArthurZucker
molbap
ani300
Assignees
molbap
Labels
New model
run-slow
State space models
Milestone
No milestone
Login to write a write a comment.
Login via GitHub