onnxruntime
c1bf7fcd - [QNN Quant] Ensure 16bit tensor quant overrides set MS domain (#19684)

Commit
2 years ago
[QNN Quant] Ensure 16bit tensor quant overrides set MS domain (#19684) ### Description Ensures that DQ and Q ops use the msft domain if tensor quantization overrides specify 16-bit integer types. ### Motivation and Context ONNX does not yet support 16bit integer types for QuantizeLinear and DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT domain. We have to also check if tensor quantization overrides force the use of 16-bit quantization types. If so, we must correctly set the domain for Q/DQ ops.
Parents
Loading