transformers
3e70a207 - Static Cache: load models with MQA or GQA (#28975)

Commit
2 years ago
Static Cache: load models with MQA or GQA (#28975)
Author
Parents
Loading