onnxruntime
4f2bd386 - [QNN EP] Ensure QDQ Split input/output quant params are equal (#18332)

Commit
2 years ago
[QNN EP] Ensure QDQ Split input/output quant params are equal (#18332) ### Description Updates QNN EP to force Split operators to use the same quant params for all input/outputs (only if they were already nearly equal). This can be necessary for the sequence Sigmoid -> Split because QNN requires Sigmoid ops to override output quant params to specific values. Also did the same for the following operators that do not change input data: - Expand - Gather - MaxPool - Reshape/Flatten/Squeeze/Unsqueeze - Resize - Split - Tile ### Motivation and Context The QNN HTP backend employs certain optimizations when all the quantization parameters for the Split operator are equal. We need to ensure they are equal to get better inference latency performance. --------- Signed-off-by: adrianlizarraga <adlizarraga@microsoft.com>
Parents
Loading