vllm
103f0de5
- [ROCm][Quantization][1/N] Refactor quark_moe w_mxfp4 w/ oracle (#38774)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
18 days ago
[ROCm][Quantization][1/N] Refactor quark_moe w_mxfp4 w/ oracle (#38774) Signed-off-by: Bowen Bao <bowenbao@amd.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
References
#38774 - [ROCm][Quantization][1/N] Refactor quark_moe w_mxfp4 w/ oracle
Author
BowenBao
Parents
32e0c0bf
Loading