onnxruntime
5803a146 - Exclude node quantization in RTN (#23683)

Commit
359 days ago
Exclude node quantization in RTN (#23683) ### Description Exclude node quantization in RTN ### Motivation and Context We want to do mixed quantization. Tested locally with success. I originally submitted a PR https://github.com/intel/neural-compressor/pull/2114 But they suggested me to add `nodes_to_exclude` into `weight_config` when we call the API
Author
Parents
Loading