vllm
54a66e5f - [Misc] Update `compressed-tensors` WNA16 to support zero-points (#14211)

Comment changes are shownComment changes are hidden
Commit
89 days ago
[Misc] Update `compressed-tensors` WNA16 to support zero-points (#14211)
Author
Parents
  • tests/quantization
    • File
      test_compressed_tensors.py
  • vllm/model_executor/layers/quantization
    • compressed_tensors
      • File
        compressed_tensors.py
      • schemes
        • File
          compressed_tensors_wNa16.py
    • kernels/mixed_precision
      • File
        machete.py
      • File
        marlin.py
    • utils
      • File
        marlin_utils.py