openvino
[NPUW] Share kvcache between prefill and generate when chunking is enabled
#32642
Merged

[NPUW] Share kvcache between prefill and generate when chunking is enabled #32642

smirnov-alexey
smirnov-alexey Introduce lazy memory allocation for ireq's I/O
fcceeb72
smirnov-alexey Fix no tensor being present in the storage
f01b8d58
smirnov-alexey Merge branch 'master' of https://github.com/openvinotoolkit/openvino …
f6fbf64d
smirnov-alexey Address review comments
bd27bbc7
smirnov-alexey Merge branch 'master' of https://github.com/openvinotoolkit/openvino …
b258ed06
smirnov-alexey Merge branch 'master' of https://github.com/openvinotoolkit/openvino …
bd03f343
smirnov-alexey Copy Xiong's changes
7dab40d1
smirnov-alexey smirnov-alexey requested a review from dmatveev dmatveev 202 days ago
smirnov-alexey smirnov-alexey requested a review from intelgaoxiong intelgaoxiong 202 days ago
smirnov-alexey smirnov-alexey assigned dmatveev dmatveev 202 days ago
smirnov-alexey smirnov-alexey requested a review 202 days ago
smirnov-alexey smirnov-alexey requested a review 202 days ago
smirnov-alexey smirnov-alexey added do_not_review
github-actions github-actions added category: NPU
github-actions github-actions added category: NPUW
smirnov-alexey
smirnov-alexey commented on 2025-10-31
dmatveev dmatveev added this to the 2026.0 milestone 202 days ago
smirnov-alexey Remove copy
20af381a
intelgaoxiong
intelgaoxiong commented on 2025-11-04
smirnov-alexey WIP
a9e2029c
smirnov-alexey Fix concurrency issue with iterator invalidation
a54d529c
smirnov-alexey Merge branch 'master' of https://github.com/openvinotoolkit/openvino …
ec23004b
smirnov-alexey Refactoring
a09921fa
smirnov-alexey Fix merge
8146143e
smirnov-alexey Protect get_tensor by mutex
42b7c1d4
smirnov-alexey Merge branch 'as/npuw_lazy_io_alloc' of https://github.com/smirnov-al…
cf01cf83
smirnov-alexey Disable kv cache sharing when one of the models is transposed
430af31b
intelgaoxiong
intelgaoxiong commented on 2025-11-06
smirnov-alexey Fix strides
4ab490b8
esmirno
esmirno approved these changes on 2025-11-07
intelgaoxiong
smirnov-alexey Handle strided tensors - copy on host
05895d99
smirnov-alexey Merge branch 'master' of https://github.com/openvinotoolkit/openvino …
5ececfc6
smirnov-alexey
smirnov-alexey Handle strided tensors in pyramid attention
2183e6a9
smirnov-alexey
intelgaoxiong
intelgaoxiong approved these changes on 2025-11-10
smirnov-alexey
smirnov-alexey commented on 2025-11-11
smirnov-alexey
smirnov-alexey commented on 2025-11-11
smirnov-alexey
smirnov-alexey commented on 2025-11-11
smirnov-alexey
smirnov-alexey commented on 2025-11-11
smirnov-alexey
smirnov-alexey commented on 2025-11-11
smirnov-alexey Address review comments
ea2b7a06
smirnov-alexey Merge branch 'as/npuw_lazy_io_alloc' of https://github.com/smirnov-al…
c2d6cb7a
smirnov-alexey Address review comments
afa41468
smirnov-alexey Merge branch 'master' into as/npuw_lazy_io_alloc
3c1120d1
smirnov-alexey Merge branch 'master' into as/npuw_share_kvcache
a76cea0e
smirnov-alexey Fix shape
db789400
dmatveev dmatveev removed do_not_review
dmatveev dmatveev added do not merge
smirnov-alexey Merge branch 'master' of https://github.com/openvinotoolkit/openvino …
0b84f331
smirnov-alexey Remove is_io()
cace85bf
smirnov-alexey Merge branch 'as/npuw_lazy_io_alloc' of https://github.com/smirnov-al…
25262866
smirnov-alexey Move kv cache copy to second token time
9088de64
smirnov-alexey Fix empty tensors for pyramid attention
ef2d3659
smirnov-alexey Merge branch 'as/npuw_share_kvcache' of https://github.com/smirnov-al…
494672af
intelgaoxiong
smirnov-alexey Merge branch 'master' into as/npuw_share_kvcache
a69af6b1
smirnov-alexey
smirnov-alexey commented on 2025-11-27
smirnov-alexey Align code with latest pyramid changes merged
f7d75f4f
dmatveev dmatveev removed do not merge
dmatveev
dmatveev approved these changes on 2025-11-27
dmatveev dmatveev merged 3797c559 into master 175 days ago
dmatveev dmatveev deleted the as/npuw_share_kvcache branch 175 days ago

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone