vllm
fix(ngram): match async ngram_gpu acceptance rate to CPU
#44056
Open

fix(ngram): match async ngram_gpu acceptance rate to CPU #44056

shiyangyang2001-lgtm
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from njhill njhill 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from benchislett benchislett 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from luccafong luccafong 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from MatthewBonanni MatthewBonanni 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from WoosukKwon WoosukKwon 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from robertgshaw2-redhat robertgshaw2-redhat 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from ywang96 ywang96 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from alexm-redhat alexm-redhat 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from heheda12345 heheda12345 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from ApostaC ApostaC 5 days ago
shiyangyang2001-lgtm shiyangyang2001-lgtm requested a review from orozery orozery 5 days ago
github-actions
mergify mergify added speculative-decoding
mergify mergify added v1
shiyangyang2001-lgtm shiyangyang2001-lgtm changed the title [ngram][async] Fix ngram_gpu acceptance rate to match CPU ngram fix(ngram): match async ngram_gpu acceptance rate to CPU 3 days ago
shiyangyang2001-lgtm [spec decode] Pass ngram-trimmed invalid tokens to scheduler stats
9d284c3a
shiyangyang2001-lgtm shiyangyang2001-lgtm force pushed from 08e5c0a1 to 9d284c3a 3 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone