vllm
355be167
- fuse & parallelize linears
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
30 days ago
fuse & parallelize linears Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
References
#38595 - [Specialized Models] Implement optimized DeepSeek V3.2 NVFP4
Author
WoosukKwon
Parents
d67e21b2
Loading