llama.cpp
kv-cache : use ggml_set_rows
#14285
Merged

kv-cache : use ggml_set_rows #14285

ggerganov merged 7 commits into master from gg/kv-cache-use-set-rows
ggerganov
github-actions github-actions added ggml
Base automatically changed from gg/model-rework-out-ids to master 180 days ago
rgerganov
ggerganov ggerganov force pushed to 5f87f289 180 days ago
ggerganov
ggerganov
ggerganov ggerganov force pushed to db0cd695 179 days ago
ggerganov
ggerganov
ggerganov ggerganov force pushed from a0c0fb6e 179 days ago
ggerganov ggerganov force pushed to d1da9927 179 days ago
github-actions github-actions added examples
ggerganov ggerganov force pushed to 14554a82 178 days ago
ggerganov ggerganov marked this pull request as ready for review 178 days ago
ggerganov ggerganov force pushed 178 days ago
ggerganov ggerganov force pushed to e1aba6af 178 days ago
github-actions github-actions added testing
github-actions github-actions added Apple Metal
ggerganov ggerganov force pushed from 5983eb1f 177 days ago
ggerganov ggerganov force pushed 177 days ago
ggerganov ggerganov force pushed 177 days ago
rgerganov
ggerganov
ggerganov ggerganov force pushed 177 days ago
ggerganov ggerganov force pushed to 36f8e20d 176 days ago
ggerganov ggerganov marked this pull request as draft 173 days ago
ggerganov ggerganov force pushed from 36f8e20d 172 days ago
ggerganov ggerganov force pushed 172 days ago
ggerganov ggerganov force pushed to 06bb08ac 172 days ago
ggerganov ggerganov force pushed from 06bb08ac 169 days ago
ggerganov ggerganov force pushed 169 days ago
ggerganov ggerganov force pushed to 45341236 169 days ago
ggerganov ggerganov marked this pull request as ready for review 168 days ago
ggerganov
ggerganov ggerganov requested a review from slaren slaren 168 days ago
slaren
slaren approved these changes on 2025-07-01
github-actions github-actions added Vulkan
github-actions github-actions added SYCL
github-actions github-actions added Ascend NPU
github-actions github-actions added OpenCL
ggerganov kv-cache : use ggml_set_rows
cd811b7a
ggerganov graph : separate k and v indices
ac8f3474
ggerganov cont : remove redundant ifs
2ac5be3a
ggerganov kv-cache : improve find_slot impl
a70293bc
ggerganov kv-cache : bounds-check when accessing slot_info indices
f3da97e6
ggerganov kv-cache : add comments
5495ea96
ggerganov ggml : add TODOs for adding GGML_OP_SET_ROWS support in the backends
30b4d4e1
ggerganov ggerganov force pushed to 30b4d4e1 167 days ago
ggerganov ggerganov merged a70c8a0c into master 167 days ago
ggerganov ggerganov deleted the gg/kv-cache-use-set-rows branch 167 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone