vllm
59d53066
- [Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation (#32993)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
74 days ago
[Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation (#32993) Signed-off-by: wzhao18 <wzhao18.sz@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
References
#32993 - [Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation
Author
wzhao18
Parents
4a9952ec
Loading