vllm
59d53066 - [Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation (#32993)

Commit
74 days ago
[Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation (#32993) Signed-off-by: wzhao18 <wzhao18.sz@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
Author
Parents
Loading