onnxruntime
[CUDA] Update sm check for flash attention
#24584
Merged

[CUDA] Update sm check for flash attention #24584

tianleiwu merged 1 commit into main from tlwu/enable_flash_attn_blackwell
tianleiwu
tianleiwu loose constraint on sm for flash attention
8ecafdf7
tianleiwu tianleiwu marked this pull request as draft 245 days ago
tianleiwu tianleiwu marked this pull request as ready for review 245 days ago
tianleiwu tianleiwu requested a review from aciddelgado aciddelgado 245 days ago
tianleiwu tianleiwu requested a review from hanbitmyths hanbitmyths 245 days ago
tianleiwu tianleiwu added release:1.22.0
hanbitmyths
hanbitmyths approved these changes on 2025-04-29
baijumeswani
baijumeswani approved these changes on 2025-04-29
tianleiwu tianleiwu merged 4adef01e into main 245 days ago
tianleiwu tianleiwu deleted the tlwu/enable_flash_attn_blackwell branch 245 days ago
snnn snnn removed release:1.22.0
snnn

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone