vllm
dbebb7f8
- [Perf] Reuse workspace for FP8+FP4 Marlin MoE (#20500)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
109 days ago
[Perf] Reuse workspace for FP8+FP4 Marlin MoE (#20500) Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com>
References
#20500 - [Perf] Reuse workspace for FP8+FP4 Marlin MoE
Author
mgoin
Parents
3053a22b
Loading