vllm
24090c52
- adjust fused_q grid
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
18 days ago
adjust fused_q grid Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
References
#38595 - [Specialized Models] Implement optimized DeepSeek V3.2 NVFP4
Author
WoosukKwon
Parents
063fd29c
Loading