vllm
3628bcaa
- [ROCm][MXFP4] Infer w4a4 quant method in rocm aiter fused moe (#29775)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
30 days ago
[ROCm][MXFP4] Infer w4a4 quant method in rocm aiter fused moe (#29775) Signed-off-by: ZhiweiYan-96 <zhiwei.yan@amd.com>
References
#29775 - [ROCm][MXFP4] Infer w4a4 quant method in rocm aiter fused moe
Author
ZhiweiYan-96
Parents
b73b158a
Loading