vllm
c027541e
- [Hybrid] Enable spec decoding in mamba cache align mode (#33705)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
8 days ago
[Hybrid] Enable spec decoding in mamba cache align mode (#33705) Signed-off-by: huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com>
References
#33705 - [Hybrid] Enable spec decoding in mamba cache align mode
Author
peakcrosser7
Parents
fd267bc7
Loading