[V1] [Spec Decode] Support random sampling for spec decode #13933
change rejection sampler api
1f69c5a2
change rejection sampler api
3399a00f
fix
24acbb43
minor
18a18153
update rejection sampler tests
269e48b2
minor
0e0bf0ae
minor
b1f6228f
fix tests
a2c830ac
rejection sampler in model runner
715f7b18
minor
4b89ffc7
add comments
3950b5c5
runnable, need to check correctness
e53fa249
minor
f4b6d385
minor
2292d596
merge
e6b49146
pass basic correctness
ec459626
pass all tests
d706b88a
minor
923b980d
LiuXiaoxuanPKU
marked this pull request as ready for review 1 year ago
minor
7c867fe6
fix tests
aece5d9a
fix device
a109cbb3
optimize uniform sample
b8a31b3b
fix tests
40253f19
style
8c4c8b20
LiuXiaoxuanPKU
changed the title [V1] [Spec Decode] Support random sampling in Rejection Sampler [V1] [Spec Decode] Support random sampling for spec decode 1 year ago
runnable but need to check correctness
1118c437
basic correctness
c7ecc351
merge
be0bece3
skip topp topk
2235d58c
clean up naming and v1 tests
5e3f16aa
separate v0, v1 tests
0cd7654e
fix tests
3d93c7d9
minor
1bcc9c02
docstr
e65ae19e
fix comments;
124db240
fix merge
cd24f5ec
Update vllm/v1/sample/rejection_sampler.py
381de2db
Update vllm/v1/sample/rejection_sampler.py
a52d6050
Update vllm/v1/sample/rejection_sampler.py
f989e1e7
Update vllm/v1/spec_decode/utils.py
f2c11fdb
Merge branch 'main' into random-sampling
94243bf7
fix comments
4fd945b4
more fix
8875d1ae
fix comments
e7c58056
change rejection sampling output
8fff235a
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub