vllm
b443e670
- fuse mla cache
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
20 days ago
fuse mla cache Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
References
#38595 - [Specialized Models] Implement optimized DeepSeek V3.2 NVFP4
Author
WoosukKwon
Parents
34d73a33
Loading