vllm
11e2375f
- [Refactor] Move MXFP8 GEMM management into MxFp8LinearKernel (#39205)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
16 days ago
[Refactor] Move MXFP8 GEMM management into MxFp8LinearKernel (#39205) Signed-off-by: mgoin <mgoin64@gmail.com>
References
#39205 - [Refactor] Move MXFP8 GEMM management into MxFp8LinearKernel
Author
mgoin
Parents
fc645f1a
Loading