vllm
4a06e124
- [Perf] Batch KV cache swap copies via cuMemcpyBatchAsync (#38460)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
19 days ago
[Perf] Batch KV cache swap copies via cuMemcpyBatchAsync (#38460) Signed-off-by: Itay Etelis <itay.etelis@ibm.com> Co-authored-by: Itay Etelis <itay.etelis@ibm.com> Co-authored-by: Or Ozeri <oro@il.ibm.com>
References
#38460 - [Perf] Batch KV cache swap copies via cuMemcpyBatchAsync
Author
Etelis
Parents
3bc2734d
Loading