transformers
3e70a207
- Static Cache: load models with MQA or GQA (#28975)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Static Cache: load models with MQA or GQA (#28975)
References
#28975 - Static Cache: load models with MQA or GQA
Author
gante
Parents
da20209d
Loading