vllm
df1e30e7
- [Quant] add CompressedTensorsW8A8Mxfp8 for linear and MoE layers (#38815)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
21 days ago
[Quant] add CompressedTensorsW8A8Mxfp8 for linear and MoE layers (#38815) Signed-off-by: EdalatiAli <aliedalati@cohere.com>
References
#38815 - [Quant] add CompressedTensorsW8A8Mxfp8 for linear and MoE layers
Author
EdalatiAli
Parents
bd8bd523
Loading