onnxruntime
[CUDA] cuDNN Flash Attention
#21629
Merged

[CUDA] cuDNN Flash Attention #21629

tianleiwu merged 3 commits into main from tlwu/cudnn_flash_att
tianleiwu
tianleiwu tianleiwu marked this pull request as draft 1 year ago
tianleiwu tianleiwu force pushed from a3608a57 to 1ac4cf82 1 year ago
tianleiwu Add cudnn sdpa
9c78a6d0
tianleiwu tianleiwu force pushed from 93d8708e to 9c78a6d0 1 year ago
tianleiwu tianleiwu marked this pull request as ready for review 1 year ago
tianleiwu undo unrelated; static_cast; comments
388dabe5
tianleiwu tianleiwu requested a review 1 year ago
tianleiwu tianleiwu requested a review 1 year ago
tianleiwu tianleiwu force pushed from f6f82efa to 388dabe5 1 year ago
tianleiwu tianleiwu requested a review from wangyems wangyems 1 year ago
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 1 year ago
tianleiwu tianleiwu requested a review from yufenglee yufenglee 1 year ago
tianleiwu Merge branch 'main' into tlwu/cudnn_flash_att
00193129
kunal-vaishnavi
kunal-vaishnavi commented on 2024-08-19
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 1 year ago
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2024-08-20
tianleiwu tianleiwu merged fbc39272 into main 1 year ago
tianleiwu tianleiwu deleted the tlwu/cudnn_flash_att branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone