onnxruntime
e998af75
- seq len threshold to trigger flash for packed qkv
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
seq len threshold to trigger flash for packed qkv
References
#17227 - Flash Attention v2 MHA
Author
tianleiwu
Parents
ee2296fc
Loading