vllm
b58669cb - [Perf][Spec Decode] Avoid per-step numpy allocation in prepare_next_t… (#41043)

Commit
40 days ago
[Perf][Spec Decode] Avoid per-step numpy allocation in prepare_next_t… (#41043) Signed-off-by: wangluochao902 <wangluochao902@gmail.com> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
Parents
Loading