vllm
f03d82ef
- [UX][Bugfix] Fix OOM by setting PyTorch `max_split_size_mb` during model loading (#41268)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
22 days ago
[UX][Bugfix] Fix OOM by setting PyTorch `max_split_size_mb` during model loading (#41268) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
References
#41268 - [UX][Bugfix] Fix OOM by setting PyTorch `max_split_size_mb` during model loading
Author
MatthewBonanni
Parents
a7fb0085
Loading