vllm
[Refactor] Move MXFP8 GEMM management into MxFp8LinearKernel
#39205
Merged

[Refactor] Move MXFP8 GEMM management into MxFp8LinearKernel #39205

mgoin
mgoin [Refactor] Move MXFP8 GEMM management into MxFp8LinearKernel
f9e8cba3
mgoin mgoin requested a review from robertgshaw2-redhat robertgshaw2-redhat 35 days ago
mgoin mgoin requested a review from tlrmchlsmth tlrmchlsmth 35 days ago
mgoin mgoin requested a review from yewentao256 yewentao256 35 days ago
mgoin mgoin requested a review from pavanimajety pavanimajety 35 days ago
mgoin mgoin removed review request from tlrmchlsmth tlrmchlsmth 35 days ago
mgoin mgoin removed review request from pavanimajety pavanimajety 35 days ago
mgoin mgoin removed review request from yewentao256 yewentao256 35 days ago
mgoin mgoin added ready
mgoin mgoin added nvidia
mgoin mgoin added quantization
gemini-code-assist
gemini-code-assist commented on 2026-04-07
mergify
mergify mergify added needs-rebase
mgoin Merge origin/main into mxfp8-linear-kernel, resolving conflicts
16245f7e
mergify mergify removed needs-rebase
mgoin Merge branch 'main' into mxfp8-linear-kernel
2f7e18ca
mergify
mergify mergify added needs-rebase
mgoin Merge branch 'main' into mxfp8-linear-kernel
5e4dacbd
mgoin Fix Mxfp8OnlineLinearMethod AttributeError on activation_quant_key
e0796502
mergify mergify removed needs-rebase
vllm-bot vllm-bot merged 11e2375f into main 32 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone