vllm
df1e30e7 - [Quant] add CompressedTensorsW8A8Mxfp8 for linear and MoE layers (#38815)

Commit

21 days ago

[Quant] add CompressedTensorsW8A8Mxfp8 for linear and MoE layers (#38815) Signed-off-by: EdalatiAli <aliedalati@cohere.com>

References

Author

EdalatiAli

Parents