onnxruntime
791bbc3d
- Merge branch 'yufeng/gqa_opt' into aciddelgado/gqa_rotary
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Merge branch 'yufeng/gqa_opt' into aciddelgado/gqa_rotary
References
#18906 - GQA Rotary and Packed QKV with Flash
Author
aciddelgado
Parents
ae34d0cf
b0a1006c
Loading