transformers
Dynamic number of speculative tokens in order to accelerate speculative decoding
#33258
Merged

Dynamic number of speculative tokens in order to accelerate speculative decoding #33258

LysandreJik merged 21 commits into huggingface:main from jmamou:SL
jmamou
jmamou optimal Speculation Lookahead based on probability
1e33d372
jmamou update peer finished condition
f1d92b19
jmamou Merge branch 'huggingface:main' into SL
3a252122
jmamou add support to do_sample True
21ab0247
jmamou add stopping criteria
e7610f89
jmamou gitignore
a0b107d9
jmamou Merge branch 'main' into SL
6f15efa0
jmamou add print
adf35984
jmamou remove prints
39b9f63e
jmamou minor
bdda459c
jmamou minor
1916bcd6
jmamou git ignore
6fea2b87
jmamou Merge branch 'main' into SL
00e3e798
jmamou adding test to stopping ConfidenceCriteria
7b0103d6
jmamou doc + format
7d4a0959
jmamou add doc
1e6a0e0b
amyeroberts amyeroberts added Generation
gante
gante commented on 2024-09-05
jmamou Update .gitignore
7a005d21
jmamou update docstring and default value of assistant_confidence_threshold
201741bb
jmamou add docstring
7c90a8a5
jmamou
gante
gante approved these changes on 2024-09-10
gante gante requested a review from LysandreJik LysandreJik 1 year ago
jmamou Update src/transformers/generation/configuration_utils.py
f457553f
jmamou style fix
cd71a924
jmamou
jmamou jmamou closed this 1 year ago
jmamou
jmamou jmamou reopened this 1 year ago
LysandreJik
LysandreJik approved these changes on 2024-09-11
LysandreJik LysandreJik merged 7a51cbc6 into main 1 year ago
gante
jmamou jmamou deleted the SL branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone