transformers
7f218258 - fix: infinite loop when cache is full

Commit
144 days ago
fix: infinite loop when cache is full * fix: use correct `layer_type` name for sliding attn * refactor: return generated request ids when calling `add_requests`
Author
Committer
Parents
Loading