llama : pad KV cache size #4280
llama : pad KV cache size to 32
3e68df86
ggerganov
force pushed
to
3e68df86
2 years ago
metal : try to improve batched decoding
3cb1c348
ggerganov
force pushed
to
3cb1c348
2 years ago
ggerganov
merged
d7b800b8
into master 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub