vllm
[Feature][Perf] Support Selective CPU Weight Offloading
#34535
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
Commits
Only offload moe weights
wzhao18
committed
139 days ago
Parameterize offloading weights
wzhao18
committed
139 days ago
format code
wzhao18
committed
139 days ago
Fix pydantic validation error
wzhao18
committed
139 days ago
Loading