llama.cpp
a4b6341c
- wip : template for rows per warp
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
wip : template for rows per warp
References
#5021 - ggml : add Flash Attention
Author
ggerganov
Committer
ggerganov
Parents
f31955f5
Loading