onnxruntime
69ab4670
- CUDA UpsampleNearest performance improvement (#7592)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
CUDA UpsampleNearest performance improvement (#7592) * Made rank a template parameter of _UpampleNearestKernel * Added error checking for rank specified to UpampleImpl * Added __restrict__ keyboard to input and output arrays in Upsample
References
#7592 - CUDA UpsampleNearest performance improvement
Author
cschreib-ibex
Parents
129722db
Loading