onnxruntime
801006d8 - [QNN-EP] Define SpaceToDepth fusion for YOLOv2. (#24848)

Commit

1 year ago

[QNN-EP] Define SpaceToDepth fusion for YOLOv2. (#24848) ### Description  - Add SpaceToDepth fusion for QNN preprocess. - The pattern in YOLOv2 is uncommon while the common seen one is left as future work. - Add entry point/API for non-quantization user to preprocess models for QNN execution. - Revise cmake to package newly introduced directory into Python wheel. ### Motivation and Context  - While executing YOLOv2 model on QNN-EP, a sequence of Reshape and Transpose having 6D shapes are falling back to CPU due to HTP limitation. Add fusion to fuse this sequence of ops into a single SpaceToDepth which can be directly executed on QNN-EP. - Since current QNN preprocess is provided in `onnxruntime/python/tools/quantization/execution_providers/qnn/preprocess.py` which is under quantization directory, the path may be confusing for non-quantization users. In order to allow non-quantization users to preprocess models for QNN, introduce `onnxruntime/python/tools/qnn/preprocess.py` to serve as the entry point and provide API to preprocess models.

References

#24848 - [QNN-EP] Define SpaceToDepth fusion for YOLOv2.

Author

minfhong-quic

Parents

9349c370

onnxruntime 801006d8 - [QNN-EP] Define SpaceToDepth fusion for YOLOv2. (#24848)

onnxruntime
801006d8 - [QNN-EP] Define SpaceToDepth fusion for YOLOv2. (#24848)