vllm
4a06e124 - [Perf] Batch KV cache swap copies via cuMemcpyBatchAsync (#38460)

Commit
19 days ago
[Perf] Batch KV cache swap copies via cuMemcpyBatchAsync (#38460) Signed-off-by: Itay Etelis <itay.etelis@ibm.com> Co-authored-by: Itay Etelis <itay.etelis@ibm.com> Co-authored-by: Or Ozeri <oro@il.ibm.com>
Author
Parents
Loading