vllm
a2c8fc66 - [ROCm][Quantization][3/N] Refactor quark_moe w4a4 w/ oracle (#41436)

Commit
8 days ago
[ROCm][Quantization][3/N] Refactor quark_moe w4a4 w/ oracle (#41436) Signed-off-by: Bowen Bao <bowenbao@amd.com>
Author
Parents
Loading