vllm
c905684c - [Core] Asynchronous h2d in merge_multimodal_embeddings via pinned memory. (#23686)

Commit
104 days ago
[Core] Asynchronous h2d in merge_multimodal_embeddings via pinned memory. (#23686) Signed-off-by: Chenheli Hua <huachenheli@outlook.com> Co-authored-by: Roger Wang <hey@rogerw.io>
Author
Parents
Loading