vllm
4d251ad0
- Fix CompressedTensorsWNA16MoE with grouped scales (#13769)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
168 days ago
Fix CompressedTensorsWNA16MoE with grouped scales (#13769)
References
#13769 - Fix CompressedTensorsWNA16MoE with grouped scales
Author
mgoin
Parents
18e50593
Files
1
vllm/model_executor/layers/quantization/compressed_tensors
compressed_tensors_moe.py
Loading