llama.cpp
4b8d5e38
- llama : quantize attention results
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
llama : quantize attention results
References
quant-attn
#1103 - llama : quantize attention results
Author
ggerganov
Committer
ggerganov
Parents
10f19c11
Loading