onnxruntime
0e76730d
- merge bnsh and no buff
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
merge bnsh and no buff
References
#17674 - [CUDA] GroupQueryAttention operator using FlashAttention
Author
aciddelgado
Parents
470a8a7e
6d681ee2
Loading