vllm
[AMD][FP8] Using MI300 FP8 format on ROCm for block_quant
#12134
Merged

[AMD][FP8] Using MI300 FP8 format on ROCm for block_quant #12134

gshtras
gshtras Requantizing fp8 weights into NANOO format on rocm platform. Conditio…
1d54e3cb
github-actions
mgoin
mgoin approved these changes on 2025-01-17
mgoin mgoin added ready
gshtras
mgoin mgoin enabled auto-merge (squash) 324 days ago
mgoin mgoin merged b5b57e30 into main 324 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone