onnxruntime
6e6ad2cb
- rotary fully implemented
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
rotary fully implemented
References
aciddelgado/gqa_rotary
#18906 - GQA Rotary and Packed QKV with Flash
Author
aciddelgado
Parents
791bbc3d
Loading