onnxruntime
5fb30743
- implement split kv attention and multihead attention
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
implement split kv attention and multihead attention
References
aciddelgado/split_kv
Author
aciddelgado
Parents
cdc65dcc
Loading