onnxruntime
1e78bcea
- Implement CUDA IsInf-10,20 (#19772)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Implement CUDA IsInf-10,20 (#19772) ### Description Implment IsInf-10,20 for CUDA. Add FP16 types also on CPU. ### Motivation and Context Certain models lag in performance due to IsInf not available on CUDA.
References
wangye/cuda_graph_2
#19772 - Implement CUDA IsInf-10,20
Author
yuslepukhin
Parents
06e684c9
Loading