vllm
3628bcaa - [ROCm][MXFP4] Infer w4a4 quant method in rocm aiter fused moe (#29775)

Commit
31 days ago
[ROCm][MXFP4] Infer w4a4 quant method in rocm aiter fused moe (#29775) Signed-off-by: ZhiweiYan-96 <zhiwei.yan@amd.com>
Author
Parents
Loading