vllm
f9170209 - [Perf] Optimize FusedMoEModularKernel output tensor using torch.empty (#35794)

Commit
54 days ago
[Perf] Optimize FusedMoEModularKernel output tensor using torch.empty (#35794) Signed-off-by: Xin Yang <xyangx@amazon.com>
Author
Parents
Loading