vllm
[MLA] Fuse cat and qaunt for fp8 kv-cache
#32950
Merged

[MLA] Fuse cat and qaunt for fp8 kv-cache #32950

LucasWilkinson
gemini-code-assist
gemini-code-assist commented on 2026-01-23
mergify
mergify mergify added documentation
LucasWilkinson fuse cat and qaunt
fa5c89c0
LucasWilkinson add run_one_batch
c0a5af1c
LucasWilkinson fix
9a8e63a6
LucasWilkinson rm
b09e7097
LucasWilkinson LucasWilkinson force pushed to b09e7097 21 days ago
LucasWilkinson LucasWilkinson marked this pull request as ready for review 21 days ago
mergify
LucasWilkinson fix
5dd115bc
LucasWilkinson LucasWilkinson requested a review from ProExpertProg ProExpertProg 21 days ago
LucasWilkinson LucasWilkinson requested a review from robertgshaw2-redhat robertgshaw2-redhat 21 days ago
mergify
LucasWilkinson clean
2aba6b7d
mergify
LucasWilkinson fix
87faa76c
robertgshaw2-redhat
robertgshaw2-redhat robertgshaw2-redhat added ready
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2026-01-23
robertgshaw2-redhat
LucasWilkinson LucasWilkinson enabled auto-merge (squash) 21 days ago
robertgshaw2-redhat
ProExpertProg
ProExpertProg approved these changes on 2026-01-23
mgoin
mgoin approved these changes on 2026-01-23
mgoin mgoin added performance
mgoin mgoin added deepseek
ProExpertProg
ProExpertProg commented on 2026-01-23
LucasWilkinson LucasWilkinson merged da5e7b12 into main 21 days ago
robertgshaw2-redhat robertgshaw2-redhat deleted the lwilkinson/fuse-cat-and-quant branch 21 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone