llama.cpp
llama : pad KV cache size
#4280
Merged

llama : pad KV cache size #4280

ggerganov merged 2 commits into master from gg/pad-kv-cache
ggerganov
slaren
ggerganov llama : pad KV cache size to 32
3e68df86
ggerganov ggerganov force pushed from 75ba5ba6 to 3e68df86 1 year ago
ggerganov metal : try to improve batched decoding
3cb1c348
ggerganov ggerganov force pushed from 10cb4593 to 3cb1c348 1 year ago
ggerganov
jhen0409
slaren
ggerganov ggerganov merged d7b800b8 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone