onnxruntime
ba8a00f6 - [TVM EP] Support zero copying TVM EP output tensor to ONNX Runtime output tensor (#12593)

Commit
2 years ago
[TVM EP] Support zero copying TVM EP output tensor to ONNX Runtime output tensor (#12593) **Description**: Support new feature of TVM Virtual Machine (method `set_outputs`) on TVM Execution Provider side. It allows to avoid excess copying from TVM EP output tensor to ONNX Runtime one **Motivation and Context** Tests with multiple output topologies and big output tensors shows that there is overheads spent on copying from TVM EP to ONNX Runtime. Returning output(s) on preallocated memory for VirtualMachine was implemented on TVM side. **Details** `set_output_zero_copy` provider option for TVM EP switches on/off this feature. It is true by default. The feature works for both GraphExecutor and VirtualMachine from TVM. --------- Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>
Author
Parents
Loading