onnxruntime
6265c3ac - Add 4bit quantizer to onnx runtime doc (#21835)

Commit
1 year ago
Add 4bit quantizer to onnx runtime doc (#21835) ### Description Introduce how to use matmul_4bits_quantizer to do weight only quantization. ### Motivation and Context Add 4bit quantizer to onnx runtime doc
Author
Parents
Loading