onnxruntime
6265c3ac
- Add 4bit quantizer to onnx runtime doc (#21835)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Add 4bit quantizer to onnx runtime doc (#21835) ### Description Introduce how to use matmul_4bits_quantizer to do weight only quantization. ### Motivation and Context Add 4bit quantizer to onnx runtime doc
References
#21835 - Add 4bit quantizer to onnx runtime doc
#24621 - Add GenAI Chat Template Function Docs
#26165 - [WIP] Modifications to Card Component Based on User Feedback
Author
fajin-corp
Parents
d491241b
Loading