Command line switch to use F16 for memory_k and memory_v (refactor of #154) #294
Use F16 for memory_k and memory_v
640b5602
add command line switch to use f16 instead of f32 for memory k+v
31edd6fa
Green-Sky
force pushed
to
31edd6fa
3 years ago
ggerganov
approved these changes
on 2023-03-19
ggerganov
merged
0b366e73
into master 3 years ago
Green-Sky
deleted the f16_memory_cli branch 3 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub