vllm
f9bf662e
- fuse parallel linears
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
25 days ago
fuse parallel linears Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
References
#38595 - [Specialized Models] Implement optimized DeepSeek V3.2 NVFP4
Author
WoosukKwon
Parents
14e2241f
Loading