onnxruntime
5803a146 - Exclude node quantization in RTN (#23683)

Commit

1 year ago

Exclude node quantization in RTN (#23683) ### Description Exclude node quantization in RTN ### Motivation and Context We want to do mixed quantization. Tested locally with success. I originally submitted a PR https://github.com/intel/neural-compressor/pull/2114 But they suggested me to add `nodes_to_exclude` into `weight_config` when we call the API

References

#23683 - Exclude node quantization in RTN

Author

jiafatom

Parents

02dc2702

onnxruntime 5803a146 - Exclude node quantization in RTN (#23683)

onnxruntime
5803a146 - Exclude node quantization in RTN (#23683)