vllm
e184c9c5 - [perf] Use CPU tensor to reduce GPU->CPU sync (#25884)

Commit
126 days ago
[perf] Use CPU tensor to reduce GPU->CPU sync (#25884) Signed-off-by: Lehua Ding <lehuading@tencent.com>
Author
Parents
Loading