vllm
b5b57e30 - [AMD][FP8] Using MI300 FP8 format on ROCm for block_quant (#12134)

Commit
328 days ago
[AMD][FP8] Using MI300 FP8 format on ROCm for block_quant (#12134) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
Author
Parents
Loading