vllm
e184c9c5
- [perf] Use CPU tensor to reduce GPU->CPU sync (#25884)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
126 days ago
[perf] Use CPU tensor to reduce GPU->CPU sync (#25884) Signed-off-by: Lehua Ding <lehuading@tencent.com>
References
#25884 - [perf] Use CPU tensor to reduce GPU->CPU sync
Author
lhtin
Parents
d7e34b42
Loading