vllm
[Bugfix][Perf] Indexer upcast WK to BF16 for fusion
#38928
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
5
Changes
View On
GitHub
[Bugfix][Perf] Indexer upcast WK to BF16 for fusion
#38928
mgoin
merged 5 commits into
vllm-project:main
from
CentML:fix-wk-fusion-fp8
upcast wk to bf16 to enable fusion
5d8d7661
benchislett
requested a review
from
zyongye
63 days ago
benchislett
requested a review
from
robertgshaw2-redhat
63 days ago
fix mtp weight load
f8a56327
benchislett
requested a review
from
luccafong
63 days ago
mergify
added
deepseek
mergify
added
bug
gemini-code-assist
commented on 2026-04-03
mergify
added
needs-rebase
mgoin
approved these changes on 2026-04-14
mgoin
added
performance
mgoin
added
ready
Merge branch 'main' into fix-wk-fusion-fp8
7a226b37
remove fp4/fp8 distinction
4184336b
benchislett
requested a review
from
LucasWilkinson
51 days ago
benchislett
requested a review
from
MatthewBonanni
51 days ago
remove irrelevant diff
481ff20d
mgoin
enabled auto-merge (squash)
51 days ago
mergify
removed
needs-rebase
mgoin
merged
ac3dac54
into main
51 days ago
benchislett
deleted the fix-wk-fusion-fp8 branch
50 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
mgoin
gemini-code-assist
zyongye
robertgshaw2-redhat
luccafong
LucasWilkinson
MatthewBonanni
Assignees
No one assigned
Labels
bug
performance
ready
deepseek
Milestone
No milestone
Login to write a write a comment.
Login via GitHub