onnxruntime
385fab5b - [QNN EP] Qnn cache improvement (#17757)

Commit
2 years ago
[QNN EP] Qnn cache improvement (#17757) ### Description Improve the QNN context binary cache feature to reduce the memory overhead and initialization time overhead. Instead of dumping a Qnn context binary file with metadata as header, we dump a Onnx format file with metadata inside Onnx node. ### Motivation and Context reduce the memory overhead and initialization time overhead
Author
Parents
Loading