vllm
4896d0c2
- [Quant] Fix use_mla TypeError and support loading pure-sparsity Compressed Tensors configs (#12711)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
314 days ago
[Quant] Fix use_mla TypeError and support loading pure-sparsity Compressed Tensors configs (#12711)
References
#12711 - [Quant] Fix use_mla TypeError and support loading pure-sparsity Compressed Tensors configs
Author
kylesayrs
Parents
bb392af4
Loading