onnxruntime
25c43c1f
- K quant (#24615)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
357 days ago
K quant (#24615) ### Description Integrate some neural compressor code since the ORT side in the repo is in maintenance mode. ### Motivation and Context Enable k-quant quantization.
References
#24615 - K quant
Author
jiafatom
Parents
f0d3c33d
Loading