transformers
Fix `_speculative_sampling` implementation
#28508
Merged

Fix `_speculative_sampling` implementation #28508

ofirzaf
ofirzaf Fix _speculative_sampling implementation
0c5a1461
ofirzaf Fix wrong clamping in rejection sampling
d5224cf7
gante
gante approved these changes on 2024-01-15
gante gante requested a review from amyeroberts amyeroberts 2 years ago
amyeroberts
amyeroberts commented on 2024-01-15
ofirzaf Improve readability of _speculative_sampling
2725e160
ofirzaf
gante
ofirzaf @ofirzaf
190a63a0
ofirzaf ofirzaf force pushed to 190a63a0 2 years ago
danielkorat
amyeroberts
amyeroberts commented on 2024-01-17
ofirzaf
ofirzaf
danielkorat
gante
gante
ofirzaf Clarify speculative behavior on acceptance of EOS
48b26852
ofirzaf Avert candidate generation if max_new_tokens == 0
3b9e4063
ofirzaf Add test for _speculative_sampling
a55181f0
ofirzaf
danielkorat
gante
gante
gante approved these changes on 2024-01-19
danielkorat
gante
amyeroberts
amyeroberts approved these changes on 2024-01-19
gante gante merged 9efec114 into main 2 years ago
ofirzaf ofirzaf deleted the fix-speculative-decoding-algo branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone