llama.cpp
0b366e73
- Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294) * Use F16 for memory_k and memory_v * add command line switch to use f16 instead of f32 for memory k+v --------- Co-authored-by: Ty Everett <ty@tyweb.us>
References
#294 - Command line switch to use F16 for memory_k and memory_v (refactor of #154)
Author
Green-Sky
Parents
160bfb21
Loading