llama.cpp
Command line switch to use F16 for memory_k and memory_v (refactor of #154)
#294
Merged

Command line switch to use F16 for memory_k and memory_v (refactor of #154) #294

Green-Sky
Green-Sky Green-Sky force pushed 3 years ago
Green-Sky Green-Sky force pushed 3 years ago
ty-everett Use F16 for memory_k and memory_v
640b5602
Green-Sky add command line switch to use f16 instead of f32 for memory k+v
31edd6fa
Green-Sky Green-Sky force pushed to 31edd6fa 3 years ago
ggerganov
ggerganov approved these changes on 2023-03-19
ggerganov ggerganov merged 0b366e73 into master 3 years ago
Green-Sky Green-Sky deleted the f16_memory_cli branch 3 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone