vllm
9a2a6357 - [Bugfix] Fix FP8 Marlin MoE and enable for compressed-tensors models (#18026)

Commit
269 days ago
[Bugfix] Fix FP8 Marlin MoE and enable for compressed-tensors models (#18026) Signed-off-by: mgoin <mgoin64@gmail.com>
Author
Parents
Loading