vllm
dbebb7f8 - [Perf] Reuse workspace for FP8+FP4 Marlin MoE (#20500)

Commit
109 days ago
[Perf] Reuse workspace for FP8+FP4 Marlin MoE (#20500) Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com>
Author
Parents
Loading