vllm
4d251ad0 - Fix CompressedTensorsWNA16MoE with grouped scales (#13769)

Comment changes are shownComment changes are hidden
Commit
168 days ago
Fix CompressedTensorsWNA16MoE with grouped scales (#13769)
Author
Parents
  • vllm/model_executor/layers/quantization/compressed_tensors
    • File
      compressed_tensors_moe.py
Loading