onnxruntime
[CUDA] upgrade cutlass to 3.5.0
#20940
Merged

[CUDA] upgrade cutlass to 3.5.0 #20940

tianleiwu merged 13 commits into main from tlwu/fix_cutlass_msvc_build_error
tianleiwu
tianleiwu Add /Zc:__cplusplus
dd964cb0
tianleiwu update cutlass
e31edf74
tianleiwu Add code to use batch hook
4ee7731a
tianleiwu update cgmanifest
94a3b2e9
tianleiwu tianleiwu requested a review 1 year ago
tianleiwu tianleiwu requested a review 1 year ago
tianleiwu tianleiwu marked this pull request as draft 1 year ago
snnn
snnn commented on 2024-06-05
tianleiwu limit max head size = 1024
fd89bb95
yufenglee yufenglee requested a review from aciddelgado aciddelgado 1 year ago
tianleiwu fix linux build
be8f3c65
tianleiwu use GQAToBatchHook
02886c2e
tianleiwu cutlass patch to fix hrsqrt not found for SM<53
0f460d1d
tianleiwu suppress TRT deprecated warnings
ed63a78c
tianleiwu undo to_batch_hook
274862be
tianleiwu suppress trt deprecate warning and clean up
8d5c4c08
tianleiwu tianleiwu marked this pull request as ready for review 1 year ago
tianleiwu tianleiwu added release:1.18.1
tianleiwu
tianleiwu commented on 2024-06-07
tianleiwu tianleiwu requested a review from wangyems wangyems 1 year ago
wangyems
wangyems commented on 2024-06-10
wangyems
wangyems commented on 2024-06-10
snnn
snnn dismissed these changes on 2024-06-10
tianleiwu address review feedback
a02c612d
snnn snnn dismissed their stale review 1 year ago
Thanks for the fix!
tianleiwu fix more 4996 warnings
415c5e10
tianleiwu tianleiwu requested a review from wangyems wangyems 1 year ago
tianleiwu tianleiwu requested a review from chilo-ms chilo-ms 1 year ago
tianleiwu tianleiwu force pushed from 6adb2cdd to 415c5e10 1 year ago
wangyems
wangyems approved these changes on 2024-06-11
snnn
snnn approved these changes on 2024-06-11
tianleiwu tianleiwu requested a review from pranavsharma pranavsharma 1 year ago
faxu
faxu approved these changes on 2024-06-11
tianleiwu tianleiwu merged b3fc9b5a into main 1 year ago
tianleiwu tianleiwu deleted the tlwu/fix_cutlass_msvc_build_error branch 1 year ago
sophies927 sophies927 added triage:approved
jywu-msft jywu-msft removed triage:approved
jywu-msft jywu-msft removed release:1.18.1
jywu-msft jywu-msft added ep:CUDA
jywu-msft jywu-msft added ep:TensorRT

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone