openvino
3797c559
- [NPUW] Share kvcache between prefill and generate when chunking is enabled (#32642)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
174 days ago
[NPUW] Share kvcache between prefill and generate when chunking is enabled (#32642) Depends on lazy I/O https://github.com/openvinotoolkit/openvino/pull/32277 Sharing kvcache taken from https://github.com/dmatveev/openvino/pull/19 (kudos to Xiong)
References
#32642 - [NPUW] Share kvcache between prefill and generate when chunking is enabled
Author
smirnov-alexey
Parents
1687ab91
Loading