vllm
[MLA] Fuse cat and qaunt for fp8 kv-cache
#32950
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
7
Changes
View On
GitHub
[MLA] Fuse cat and qaunt for fp8 kv-cache
#32950
LucasWilkinson
merged 7 commits into
vllm-project:main
from
neuralmagic:lwilkinson/fuse-cat-and-quant
gemini-code-assist
commented on 2026-01-23
mergify
added
documentation
fuse cat and qaunt
fa5c89c0
add run_one_batch
c0a5af1c
fix
9a8e63a6
rm
b09e7097
LucasWilkinson
force pushed
to
b09e7097
21 days ago
LucasWilkinson
marked this pull request as ready for review
21 days ago
fix
5dd115bc
LucasWilkinson
requested a review
from
ProExpertProg
21 days ago
LucasWilkinson
requested a review
from
robertgshaw2-redhat
21 days ago
clean
2aba6b7d
fix
87faa76c
robertgshaw2-redhat
added
ready
robertgshaw2-redhat
approved these changes on 2026-01-23
LucasWilkinson
enabled auto-merge (squash)
21 days ago
ProExpertProg
approved these changes on 2026-01-23
mgoin
approved these changes on 2026-01-23
mgoin
added
performance
mgoin
added
deepseek
ProExpertProg
commented on 2026-01-23
LucasWilkinson
merged
da5e7b12
into main
21 days ago
robertgshaw2-redhat
deleted the lwilkinson/fuse-cat-and-quant branch
21 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
ProExpertProg
mgoin
robertgshaw2-redhat
gemini-code-assist
Assignees
No one assigned
Labels
documentation
performance
ready
deepseek
Milestone
No milestone
Login to write a write a comment.
Login via GitHub