vllm
b58669cb
- [Perf][Spec Decode] Avoid per-step numpy allocation in prepare_next_t… (#41043)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
40 days ago
[Perf][Spec Decode] Avoid per-step numpy allocation in prepare_next_t… (#41043) Signed-off-by: wangluochao902 <wangluochao902@gmail.com> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
References
#41043 - [Perf][Spec Decode] Avoid per-step numpy allocation in prepare_next_t…
Author
wangluochao902
Parents
1628239e
Loading