llama.cpp
0b366e73 - Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294)

Commit
3 years ago
Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294) * Use F16 for memory_k and memory_v * add command line switch to use f16 instead of f32 for memory k+v --------- Co-authored-by: Ty Everett <ty@tyweb.us>
Author
Parents
Loading