onnxruntime
7e092a7e - Reduce number of memory allocations based on a customer profiling case (#10193)

Commit

4 years ago

Reduce number of memory allocations based on a customer profiling case (#10193) Add abseil and inlined containers typedefs Introduce TensorShapeVector for shape building. Use gsl::span<const T> to make interfaces accept different types of vector like args. Introduce InineShapeVectorT for shape capacity typed instantiations Refactor cuda slice along with provider shared interfaces Refactor Concat, Conv, Pad Build with Conv Einsum and ConvTranspose refactored. Remove TesnorShape::GetDimsAsVector() Refactor SliceIterator and SliceIteratorBase Refactor broadcast Refactor Pads for twice as long Remove memory planner intermediate shapes vector Refactor orttraining Fix passing TenshroShapeVector to tests Remove abseil copy and submodule, use FetchContent_Declare/Fetch Path with separate command Make RocmAsyncBuffer accept anything convertible to span. Adjust Linux GPU pipeline.

References

#10193 - Reduce number of memory allocations based on a customer profiling case

Author

yuslepukhin

Parents

5df15c56

onnxruntime 7e092a7e - Reduce number of memory allocations based on a customer profiling case (#10193)

onnxruntime
7e092a7e - Reduce number of memory allocations based on a customer profiling case (#10193)