vllm
03b5f940
- [V1][Spec Decode] Optimize Medusa proposer to avoid GPU-CPU sync (#29723)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
25 days ago
[V1][Spec Decode] Optimize Medusa proposer to avoid GPU-CPU sync (#29723) Signed-off-by: dongbo910220 <1275604947@qq.com>
References
#29723 - [V1][Spec Decode] Optimize Medusa proposer to avoid GPU-CPU sync
Author
dongbo910220
Parents
2e7054da
Loading