Add static quantization runner (#24114)

Commit

327 days ago

Add static quantization runner (#24114) ### Description  - Add a general command-line tool for static quantization - Support loading TensorQuantOverride from json file - Add the corresponding README ### Motivation and Context  - Currently, developers are able to use preprocess tool from command line - https://onnxruntime.ai/docs/performance/model-optimizations/quantization.html#pre-processing - `python -m onnxruntime.quantization.preprocess --help` - The PR aims to provide similar usage for static quantization. - `python -m onnxruntime.quantization.static_quantize_runner --help` - Existing command-line examples in onnxruntime-inference-example are not general for arbitrary ONNX models.

References

#24114 - Add static quantization runner

Author

quic-hungjuiw

Parents

4d03aeff

onnxruntime 4b9e26de - Add static quantization runner (#24114)

onnxruntime
4b9e26de - Add static quantization runner (#24114)