text-generation-inference
ac8c0f6f
- feat(server): flash attention past key value optimizations (#213)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
feat(server): flash attention past key value optimizations (#213)
References
#213 - feat(server): flash attention past key value optimizations
Author
njhill
Parents
274513e6
Loading