llama.cpp
kv-cache : use ggml_set_rows
#14285
Merged

kv-cache : use ggml_set_rows #14285

ggerganov merged 7 commits into master from gg/kv-cache-use-set-rows
ggerganov
github-actions github-actions added ggml
Base automatically changed from gg/model-rework-out-ids to master 88 days ago
rgerganov
ggerganov ggerganov force pushed from 8f1c5e3f to 5f87f289 88 days ago
ggerganov
ggerganov
ggerganov ggerganov force pushed from 4d0c0ea0 to db0cd695 88 days ago
ggerganov
ggerganov
ggerganov ggerganov force pushed from a0c0fb6e to d40f7058 87 days ago
ggerganov ggerganov force pushed from d40f7058 to d1da9927 87 days ago
github-actions github-actions added examples
ggerganov ggerganov force pushed from 1031a5d6 to 14554a82 87 days ago
ggerganov ggerganov marked this pull request as ready for review 87 days ago
ggerganov ggerganov force pushed from c095c346 to 335161d1 86 days ago
ggerganov ggerganov force pushed from 335161d1 to e1aba6af 86 days ago
github-actions github-actions added testing
github-actions github-actions added Apple Metal
ggerganov ggerganov force pushed from 5983eb1f to 9ed11a68 86 days ago
ggerganov ggerganov force pushed from 9ed11a68 to b5fea541 86 days ago
ggerganov ggerganov force pushed from b5fea541 to c4273b88 85 days ago
rgerganov
ggerganov
ggerganov ggerganov force pushed from c4273b88 to 96327b57 85 days ago
ggerganov ggerganov force pushed from 96327b57 to 36f8e20d 85 days ago
ggerganov ggerganov marked this pull request as draft 82 days ago
ggerganov ggerganov force pushed from 36f8e20d to 0e24d896 81 days ago
ggerganov ggerganov force pushed from 0e24d896 to c246784e 81 days ago
ggerganov ggerganov force pushed from c246784e to 06bb08ac 81 days ago
ggerganov ggerganov force pushed from 06bb08ac to aef19964 78 days ago
ggerganov ggerganov force pushed from aef19964 to 3d930a9e 78 days ago
ggerganov ggerganov force pushed from 82277da4 to 45341236 78 days ago
ggerganov ggerganov marked this pull request as ready for review 77 days ago
ggerganov
ggerganov ggerganov requested a review from slaren slaren 77 days ago
slaren
slaren approved these changes on 2025-07-01
github-actions github-actions added Vulkan
github-actions github-actions added SYCL
github-actions github-actions added Ascend NPU
github-actions github-actions added OpenCL
ggerganov kv-cache : use ggml_set_rows
cd811b7a
ggerganov graph : separate k and v indices
ac8f3474
ggerganov cont : remove redundant ifs
2ac5be3a
ggerganov kv-cache : improve find_slot impl
a70293bc
ggerganov kv-cache : bounds-check when accessing slot_info indices
f3da97e6
ggerganov kv-cache : add comments
5495ea96
ggerganov ggml : add TODOs for adding GGML_OP_SET_ROWS support in the backends
30b4d4e1
ggerganov ggerganov force pushed from 2f577c5a to 30b4d4e1 76 days ago
ggerganov ggerganov merged a70c8a0c into master 75 days ago
ggerganov ggerganov deleted the gg/kv-cache-use-set-rows branch 75 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone