transformers
7f218258
- fix: infinite loop when cache is full
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
144 days ago
fix: infinite loop when cache is full * fix: use correct `layer_type` name for sliding attn * refactor: return generated request ids when calling `add_requests`
Author
McPatate
Committer
McPatate
Parents
7ece8a14
Loading