vllm
54a66e5f
- [Misc] Update `compressed-tensors` WNA16 to support zero-points (#14211)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
89 days ago
[Misc] Update `compressed-tensors` WNA16 to support zero-points (#14211)
References
#14211 - [Misc] Update `compressed-tensors` WNA16 to support zero-points
Author
dsikka
Parents
280d62b8
Files
6
tests/quantization
test_compressed_tensors.py
vllm/model_executor/layers/quantization
compressed_tensors
compressed_tensors.py
schemes
compressed_tensors_wNa16.py
kernels/mixed_precision
machete.py
marlin.py
utils
marlin_utils.py
Loading