onnxruntime
Share TunableOp between CUDA and ROCM EP
#13560
Merged

Share TunableOp between CUDA and ROCM EP #13560

cloudhan merged 9 commits into main from guangyunhan/shared-tunable
cloudhan
cloudhan cloudhan requested a review from zhangyaobit zhangyaobit 3 years ago
cloudhan Move code to preserve the editing history, also make rocm buildable a…
9888fb2b
cloudhan Make compilable and runnable with cuda and rocm
98cfa535
cloudhan Add tests for TunableOp
fcda6027
cloudhan Fix windows tests sleep timing resolution
74deabe7
cloudhan Disallow TunableOp to run if RTTI is disabled
454ea14c
cloudhan cloudhan force pushed from 9ddcde6a to ab4fd6a5 3 years ago
cloudhan cloudhan force pushed from ab4fd6a5 to 454ea14c 3 years ago
abudup
abudup commented on 2022-11-09
cloudhan Fix typos
f0c5fd8d
cloudhan Address review
a1758f02
cloudhan Also move gemm related registration files
5f6717a4
cloudhan Minor fix
e0e80de7
cloudhan cloudhan requested a review from abudup abudup 3 years ago
abudup
abudup approved these changes on 2022-11-10
zhangyaobit
zhangyaobit approved these changes on 2022-11-11
cloudhan cloudhan merged 369a8224 into main 3 years ago
cloudhan cloudhan deleted the guangyunhan/shared-tunable branch 3 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone