vllm
103f0de5 - [ROCm][Quantization][1/N] Refactor quark_moe w_mxfp4 w/ oracle (#38774)

Commit
18 days ago
[ROCm][Quantization][1/N] Refactor quark_moe w_mxfp4 w/ oracle (#38774) Signed-off-by: Bowen Bao <bowenbao@amd.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
Author
Parents
Loading