vllm
[Feature][Perf] Support Selective CPU Weight Offloading
#34535
Merged

Commits
  • Only offload moe weights
    wzhao18 committed 139 days ago
  • Parameterize offloading weights
    wzhao18 committed 139 days ago
  • format code
    wzhao18 committed 139 days ago
  • Fix pydantic validation error
    wzhao18 committed 139 days ago
Loading