vllm
df1e30e7 - [Quant] add CompressedTensorsW8A8Mxfp8 for linear and MoE layers (#38815)

Commit
21 days ago
[Quant] add CompressedTensorsW8A8Mxfp8 for linear and MoE layers (#38815) Signed-off-by: EdalatiAli <aliedalati@cohere.com>
Author
Parents
Loading