vllm
[Feature][Perf] Support Selective CPU Weight Offloading
#34535
Merged

[Feature][Perf] Support Selective CPU Weight Offloading #34535

wzhao18
wzhao18 wzhao18 requested a review from heheda12345 heheda12345 85 days ago
mergify mergify added v1
wzhao18 Only offload moe weights
8240011a
wzhao18 Parameterize offloading weights
06f1ba14
wzhao18 format code
fe927de4
wzhao18 wzhao18 force pushed to fe927de4 85 days ago
gemini-code-assist
gemini-code-assist commented on 2026-02-13
mgoin
mgoin approved these changes on 2026-02-13
mgoin mgoin added ready
mgoin mgoin added nvidia
wzhao18 Fix pydantic validation error
8435c8bc
vllm-bot vllm-bot merged b37b6797 into main 85 days ago
ehfd
wzhao18
wzhao18
ehfd

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone