vllm
a2c8fc66
- [ROCm][Quantization][3/N] Refactor quark_moe w4a4 w/ oracle (#41436)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
8 days ago
[ROCm][Quantization][3/N] Refactor quark_moe w4a4 w/ oracle (#41436) Signed-off-by: Bowen Bao <bowenbao@amd.com>
References
#41436 - [ROCm][Quantization][3/N] Refactor quark_moe w4a4 w/ oracle
Author
BowenBao
Parents
6859ca76
Loading