vllm
[Perf] Reuse workspace for FP8+FP4 Marlin MoE
#20500
Merged

[Perf] Reuse workspace for FP8+FP4 Marlin MoE #20500

mgoin
mgoin Reuse workspace for FP8+FP4 Marlin MoE
534270a2
github-actions
gemini-code-assist
gemini-code-assist commented on 2025-07-04
gemini-code-assist
gemini-code-assist commented on 2025-07-04
mgoin Merge branch 'main' into reuse-workspace-marlin-moe-fp8+fp4
e79bb05a
mgoin Merge branch 'main' into reuse-workspace-marlin-moe-fp8+fp4
91a288cd
mgoin mgoin added ready
mgoin mgoin marked this pull request as ready for review 116 days ago
mgoin mgoin requested a review from robertgshaw2-redhat robertgshaw2-redhat 116 days ago
mgoin mgoin requested a review from tlrmchlsmth tlrmchlsmth 116 days ago
mgoin mgoin requested a review from yewentao256 yewentao256 116 days ago
yewentao256
yewentao256 approved these changes on 2025-09-16
yewentao256 Merge branch 'main' into reuse-workspace-marlin-moe-fp8+fp4
4da52ce5
mgoin mgoin merged dbebb7f8 into main 115 days ago
mgoin mgoin deleted the reuse-workspace-marlin-moe-fp8+fp4 branch 115 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone