vllm
[V1] [Hybrid] Support using float32 for state in Hybrid Models (Mamba2, Mamba1, Minimax)
#22928
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
35
Changes
View On
GitHub
[V1] [Hybrid] Support using float32 for state in Hybrid Models (Mamba2, Mamba1, Minimax)
#22928
hmellor
merged 35 commits into
vllm-project:main
from
tdoublep:fp32_mamba_cache
support fp32 for mamba ssm cache
fa1bc753
rebase
a1418a9f
fix
ceccd1ec
ci: trigger vLLM pipeline
ecea362c
deal with mamba_cache_params None
bee90b27
Merge branch 'main' into fp32_mamba_cache
0835e4df
working changes
a31f5cd4
working changes
3bcff4dc
working for bamba
821a0a38
Fix type
49e7181f
Clean up
69db9b02
mergify
added
v1
gemini-code-assist
commented on 2025-08-14
minor
73d281a1
Add support for Falcon H1
f7dbfece
Add support for Zamba2
12a413b7
Add support for GraniteMoeHybrid
933d5b1e
Add support for Mamba2
6d0de8d1
Add support for mamba; linear_attention
e458090f
tomeras91
approved these changes on 2025-08-14
Support for minimax
a3b80a5c
Add support for mamba; linear_attention
94d54408
tdoublep
changed the title
[V1] [Hybrid] Support using float32 for mamba state
[V1] [Hybrid] Support using float32 for state in Hybrid Models (Mamba2, Mamba1, Minimax)
269 days ago
Adding co-author.
5b005ec4
Add fp32 state test
b61b5a61
tdoublep
force pushed
to
b61b5a61
269 days ago
Use None as default
65f81404
tdoublep
marked this pull request as ready for review
269 days ago
tdoublep
requested a review
from
DarkLight1337
269 days ago
tdoublep
requested a review
from
ywang96
269 days ago
tdoublep
requested a review
from
WoosukKwon
269 days ago
tdoublep
requested a review
from
robertgshaw2-redhat
269 days ago
tdoublep
requested a review
from
njhill
269 days ago
tdoublep
requested a review
from
comaniac
269 days ago
tdoublep
requested a review
from
alexm-redhat
269 days ago
tdoublep
requested a review
from
simon-mo
269 days ago
tdoublep
requested a review
from
youkaichao
269 days ago
tdoublep
requested a review
from
mgoin
269 days ago
tdoublep
requested a review
from
tlrmchlsmth
269 days ago
tdoublep
requested a review
from
houseroad
269 days ago
tdoublep
requested a review
from
hmellor
269 days ago
tdoublep
requested a review
from
yewentao256
269 days ago
tdoublep
requested a review
from
ProExpertProg
269 days ago
Use consistent type
01ee9af8
Fix typo
bbd27827
heheda12345
commented on 2025-08-14
refactor dtype
ceb8c98a
Fix typo
fe9ebc5f
Don't pass ssm argument to minimax dtype calc
63f0656e
Simplify dtype logic
4d83dd1d
Add assert for storage_offset_bytes.
8a03a21d
heheda12345
approved these changes on 2025-08-14
heheda12345
enabled auto-merge (squash)
269 days ago
github-actions
added
ready
disabled auto-merge
269 days ago
Manually disabled by user
heheda12345
enabled auto-merge (squash)
269 days ago
Merge branch 'main' of github.com:vllm-project/vllm into fp32_mamba_c…
aaee9028
str to torch dtype
a73483ac
Merge branch 'main' into fp32_mamba_cache
1b9ee5ee
Fix V1 test
e11b4867
disabled auto-merge
268 days ago
Head branch was pushed to by a user without write access
merge in main
b5b47a2f
Merge branch 'main' into fp32_mamba_cache
1416a74e
hmellor
enabled auto-merge (squash)
268 days ago
mgoin
added this to the
v0.10.1
milestone
268 days ago
mgoin
approved these changes on 2025-08-15
hmellor
merged
75531a6c
into main
268 days ago
tdoublep
deleted the fp32_mamba_cache branch
268 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
mgoin
heheda12345
tomeras91
gemini-code-assist
DarkLight1337
ywang96
WoosukKwon
robertgshaw2-redhat
njhill
comaniac
alexm-redhat
simon-mo
youkaichao
tlrmchlsmth
houseroad
hmellor
yewentao256
ProExpertProg
Assignees
No one assigned
Labels
ready
v1
Milestone
v0.10.1
Login to write a write a comment.
Login via GitHub