llama.cpp
36ddd129
- llama : add flash attention (demo)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
llama : add flash attention (demo)
References
flash-attn
Author
ggerganov
Committer
ggerganov
Parents
986b6ce9
Loading