text-generation-inference
e5618d6e
- add chunked attn support
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
255 days ago
add chunked attn support
References
chunked_attn_l4
Author
pcuenca
Parents
5861da1a
Loading