MultiheadAttention CUDA BF16 Support #26083
MHA BF16
6884d83d
Clean code
3930e267
nenad1002
marked this pull request as ready for review 100 days ago
use OrtCudaType
0891b14e
ci: trigger pipeline
3ef53ab8
tianleiwu
approved these changes
on 2025-09-25
nenad1002
merged
b7ae53f7
into main 93 days ago
nenad1002
deleted the nebanfic/mha-bf16 branch 93 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub