vllm
03b5f940 - [V1][Spec Decode] Optimize Medusa proposer to avoid GPU-CPU sync (#29723)

Commit
25 days ago
[V1][Spec Decode] Optimize Medusa proposer to avoid GPU-CPU sync (#29723) Signed-off-by: dongbo910220 <1275604947@qq.com>
Author
Parents
Loading