vllm
[V1] [Hybrid] Support using float32 for state in Hybrid Models (Mamba2, Mamba1, Minimax)
#22928
Merged

[V1] [Hybrid] Support using float32 for state in Hybrid Models (Mamba2, Mamba1, Minimax) #22928

tdoublep
danielafrimi support fp32 for mamba ssm cache
fa1bc753
danielafrimi rebase
a1418a9f
danielafrimi fix
ceccd1ec
danielafrimi ci: trigger vLLM pipeline
ecea362c
danielafrimi deal with mamba_cache_params None
bee90b27
tdoublep Merge branch 'main' into fp32_mamba_cache
0835e4df
tdoublep working changes
a31f5cd4
tdoublep working changes
3bcff4dc
tdoublep working for bamba
821a0a38
tdoublep Fix type
49e7181f
tdoublep Clean up
69db9b02
github-actions
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-08-14
tdoublep minor
73d281a1
tdoublep Add support for Falcon H1
f7dbfece
tdoublep Add support for Zamba2
12a413b7
tdoublep Add support for GraniteMoeHybrid
933d5b1e
tdoublep Add support for Mamba2
6d0de8d1
tdoublep Add support for mamba; linear_attention
e458090f
tomeras91
tomeras91 approved these changes on 2025-08-14
tdoublep Support for minimax
a3b80a5c
tdoublep Add support for mamba; linear_attention
94d54408
tdoublep tdoublep changed the title [V1] [Hybrid] Support using float32 for mamba state [V1] [Hybrid] Support using float32 for state in Hybrid Models (Mamba2, Mamba1, Minimax) 269 days ago
tdoublep Adding co-author.
5b005ec4
tdoublep Add fp32 state test
b61b5a61
tdoublep tdoublep force pushed to b61b5a61 269 days ago
tdoublep Use None as default
65f81404
tdoublep tdoublep marked this pull request as ready for review 269 days ago
tdoublep tdoublep requested a review from DarkLight1337 DarkLight1337 269 days ago
tdoublep tdoublep requested a review from ywang96 ywang96 269 days ago
tdoublep tdoublep requested a review from WoosukKwon WoosukKwon 269 days ago
tdoublep tdoublep requested a review from robertgshaw2-redhat robertgshaw2-redhat 269 days ago
tdoublep tdoublep requested a review from njhill njhill 269 days ago
tdoublep tdoublep requested a review from comaniac comaniac 269 days ago
tdoublep tdoublep requested a review from alexm-redhat alexm-redhat 269 days ago
tdoublep tdoublep requested a review from simon-mo simon-mo 269 days ago
tdoublep tdoublep requested a review from youkaichao youkaichao 269 days ago
tdoublep tdoublep requested a review from mgoin mgoin 269 days ago
tdoublep tdoublep requested a review from tlrmchlsmth tlrmchlsmth 269 days ago
tdoublep tdoublep requested a review from houseroad houseroad 269 days ago
tdoublep tdoublep requested a review from hmellor hmellor 269 days ago
tdoublep tdoublep requested a review from yewentao256 yewentao256 269 days ago
tdoublep tdoublep requested a review from ProExpertProg ProExpertProg 269 days ago
tdoublep Use consistent type
01ee9af8
tdoublep Fix typo
bbd27827
heheda12345
heheda12345 commented on 2025-08-14
tdoublep refactor dtype
ceb8c98a
tdoublep Fix typo
fe9ebc5f
tdoublep Don't pass ssm argument to minimax dtype calc
63f0656e
tdoublep
tdoublep Simplify dtype logic
4d83dd1d
tdoublep Add assert for storage_offset_bytes.
8a03a21d
heheda12345
heheda12345 approved these changes on 2025-08-14
heheda12345 heheda12345 enabled auto-merge (squash) 269 days ago
github-actions github-actions added ready
disabled auto-merge 269 days ago
Manually disabled by user
heheda12345 heheda12345 enabled auto-merge (squash) 269 days ago
heheda12345 Merge branch 'main' of github.com:vllm-project/vllm into fp32_mamba_c…
aaee9028
heheda12345 str to torch dtype
a73483ac
tdoublep Merge branch 'main' into fp32_mamba_cache
1b9ee5ee
tdoublep Fix V1 test
e11b4867
disabled auto-merge 268 days ago
Head branch was pushed to by a user without write access
tdoublep merge in main
b5b47a2f
tdoublep Merge branch 'main' into fp32_mamba_cache
1416a74e
hmellor hmellor enabled auto-merge (squash) 268 days ago
mgoin mgoin added this to the v0.10.1 milestone 268 days ago
mgoin
mgoin approved these changes on 2025-08-15
hmellor hmellor merged 75531a6c into main 268 days ago
tdoublep tdoublep deleted the fp32_mamba_cache branch 268 days ago
cyang49
tdoublep
cyang49

Login to write a write a comment.

Login via GitHub