vllm
[Bugfix][Perf] Indexer upcast WK to BF16 for fusion
#38928
Merged

[Bugfix][Perf] Indexer upcast WK to BF16 for fusion #38928

mgoin merged 5 commits into vllm-project:main from CentML:fix-wk-fusion-fp8
benchislett
benchislett upcast wk to bf16 to enable fusion
5d8d7661
benchislett benchislett requested a review from zyongye zyongye 63 days ago
benchislett benchislett requested a review from robertgshaw2-redhat robertgshaw2-redhat 63 days ago
benchislett fix mtp weight load
f8a56327
benchislett benchislett requested a review from luccafong luccafong 63 days ago
mergify mergify added deepseek
mergify mergify added bug
zyongye
gemini-code-assist
gemini-code-assist commented on 2026-04-03
benchislett
zyongye
mergify
mergify mergify added needs-rebase
benchislett
zyongye
benchislett
mgoin
mgoin approved these changes on 2026-04-14
mgoin mgoin added performance
mgoin mgoin added ready
benchislett Merge branch 'main' into fix-wk-fusion-fp8
7a226b37
benchislett remove fp4/fp8 distinction
4184336b
benchislett benchislett requested a review from LucasWilkinson LucasWilkinson 51 days ago
benchislett benchislett requested a review from MatthewBonanni MatthewBonanni 51 days ago
benchislett
benchislett remove irrelevant diff
481ff20d
benchislett
mgoin mgoin enabled auto-merge (squash) 51 days ago
mergify mergify removed needs-rebase
mgoin mgoin merged ac3dac54 into main 51 days ago
benchislett benchislett deleted the fix-wk-fusion-fp8 branch 50 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone