onnxruntime
9a80955c - Add compute capacity to trtep engine cache file (#16356)

Commit
2 years ago
Add compute capacity to trtep engine cache file (#16356) ### Description Add "_smXX" to trtep engine cache file name, which "sm" stands for "Streaming Multiprocessor". > The GPU compute capability version is prefixed with "SM" because NVIDIA typically improves and updates the SM in each new GPU architecture. ### Motivation and Context Github issue: https://github.com/microsoft/onnxruntime/issues/15982 Reduce the chance of misusing incompatible engine cache, when user is switching GPU devices with different compute capacity * The prevention can't be 100%, as model size & GPU memory size could be another factor to make cache incompatible
Author
Parents
Loading