vllm
f9170209
- [Perf] Optimize FusedMoEModularKernel output tensor using torch.empty (#35794)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
54 days ago
[Perf] Optimize FusedMoEModularKernel output tensor using torch.empty (#35794) Signed-off-by: Xin Yang <xyangx@amazon.com>
References
#35794 - [Perf] Optimize FusedMoEModularKernel output tensor using torch.empty
Author
xyang16
Parents
86483ca7
Loading