transformers
Universal Speculative Decoding `CandidateGenerator`
#35029
Merged

Universal Speculative Decoding `CandidateGenerator` #35029

gante merged 112 commits into huggingface:main from keyboardAnt:usd
keyboardAnt
gauravjain14
gauravjain14
jmamou
jmamou
jmamou
jmamou
move `TestAssistedCandidateGeneratorDifferentTokenizers` into a new t…
aa7e01a6
refactor
f6b7f204
NOTHING. add space to rerun github actions tests
0ded37c5
remove it...
d48b69b0
`UniversalSpeculativeDecodingGenerator`
b47e33a1
Use `UniversalSpeculativeDecodingGenerator` when `generation_config.d…
8a991299
assistant tokenizes only the target's new suffix
4649bd2f
formatting
f199c94a
jmamou fix code
19c0057d
jmamou fix code
acf5a4b3
formatting
37121172
add `TestGenerateWithDifferentModels`
63f2f462
`TestGenerateWithDifferentModels` parameterize on `do_sample`
6ac33f14
`AssistantVocabMapping` & `AssistantVocabMappingCache`
69383117
formatting
5a0db3bb
`AssistantToTargetTranslator`: `get_target_input_ids` & `get_target_l…
92f8ad3b
improve `_get_assistant_to_target_input_ids` & formatting
7c8708ed
renaming
880d0aea
WIP: debugging `min_new_tokens`
d9b5e748
jmamou fix get_target_ids
25974d5e
`UniversalSpeculativeDecodingGenerator`
b8636ab1
assistant tokenizes only the target's new suffix
1ef46b78
formatting
f8e94eb1
jmamou fix code
439db843
jmamou fix code
643901de
formatting
77097ffb
`TestGenerateWithDifferentModels` parameterize on `do_sample`
d08b4f08
`AssistantVocabMapping` & `AssistantVocabMappingCache`
f242dc11
formatting
ede11763
`AssistantToTargetTranslator`: `get_target_input_ids` & `get_target_l…
511ee964
improve `_get_assistant_to_target_input_ids` & formatting
5e479456
renaming
25a43497
WIP: debugging `min_new_tokens`
95fe744a
jmamou fix get_target_ids
0ad88b27
jmamou fix device issue
bc5fa617
jmamou fix get_assistant_input_ids
41a5670e
add `TestAssistedCandidateGeneratorDifferentTokenizers`
44f7ba70
formatting
57aafcca
`AssistantVocabTranslatorCache` refactor & tests
6f95c33d
revert changes in `src/transformers/generation/logits_process.py`
078f763e
refactor `AssistedCandidateGenerator`
faac2fcd
refactor `AssistedCandidateGeneratorDifferentTokenizers`
76a2dd39
formatting
43e96e79
refactor `UniversalSpeculativeDecodingGenerator`
e63cb9df
jmamou fix negative value for max_new_tokens
8aa6020a
jmamou fix generation length target + attention_mask vs. assistant + attent
2169973d
jmamou fix device
c6da8273
jmamou fix negative max_new_tokens bug
2cf9e8e5
jmamou fix UAG
a1c0d051
jmamou minor
d8300913
formatting
19d0cced
`AssistedCandidateGeneratorDifferentTokenizers` `lookbehind`s init
5b8217d1
resolve conflict & formatting
9b0126a8
rerun CI tests
578d0b34
remove space...
7db26954
keyboardAnt keyboardAnt force pushed from a6a6b383 to 7db26954 1 year ago
remove old code
fb699001
gauravjain14
gauravjain14
jmamou
jmamou fix candidate_input_ids device
e40c775c
jmamou minor
b5ce873f
keyboardAnt
keyboardAnt Merge pull request #4 from keyboardAnt/fix_device
bfccdeaa
formatting
d34d7eaa
zucchini-nlp
zucchini-nlp commented on 2024-12-05
jmamou
jmamou Fix prepare + apply (#7)
9d4d9f9a
gauravj14 Add unittests for Universal Assisted generation
4e92e9ce
jmamou Merge branch 'main' into usd
3fe2d31e
jmamou fix style
a350b1cb
jmamou update tests
e047adf6
jmamou
gauravjain14 Remove unused import and fix `test_speculation_depth` test
011f5956
gauravjain14 exclude special and reserved tokens from tokenizer for UAG
26524900
keyboardAnt
gauravjain14 mv `test_universal_assisted_generation.py` to `generation/test_candid…
701edbb5
gauravjain14 Merge pull request #8 from keyboardAnt/unit_tests_usd
7088978f
jmamou
jmamou
gauravjain14 Remove unused imports and fix style using `make style` (#9)
3b89341d
formatting
e43dba87
gauravjain14 Swap gated `meta-llama/llama-3.2` with `allenai/llama` (#10)
a5297951
zucchini-nlp
jmamou
Merge branch 'main' into usd
9025751b
keyboardAnt keyboardAnt requested a review from zucchini-nlp zucchini-nlp 1 year ago
jmamou Fix space sign disagreement (#12)
25cd5da8
jmamou jmamou requested a review from gante gante 1 year ago
jmamou jmamou requested a review from Rocketknight1 Rocketknight1 1 year ago
jmamou jmamou requested a review from ArthurZucker ArthurZucker 1 year ago
jmamou Default values for some fields of assistant to target translator (#11)
77edae26
jmamou Update candidate_generator.py (#15)
a2a2882b
jmamou BUG fix in _prepare_assistant_input_ids (#14)
a5569471
jmamou
jmamou requested changes on 2025-01-12
typo (`target_to_assistant_input_ids`)
407d8984
formatting
a24b1934
merge upstream/main
1afdaa3a
keyboardAnt keyboardAnt requested a review from molbap molbap 1 year ago
keyboardAnt keyboardAnt requested a review from qubvel qubvel 1 year ago
keyboardAnt keyboardAnt requested a review from eustlb eustlb 1 year ago
keyboardAnt keyboardAnt requested a review from amyeroberts amyeroberts 1 year ago
keyboardAnt keyboardAnt requested a review from Cyrilvallez Cyrilvallez 1 year ago
keyboardAnt keyboardAnt requested a review from muellerzr muellerzr 1 year ago
keyboardAnt keyboardAnt requested a review from SunMarc SunMarc 1 year ago
keyboardAnt keyboardAnt requested a review from MekkCyber MekkCyber 1 year ago
keyboardAnt keyboardAnt requested a review from yonigozlan yonigozlan 1 year ago
keyboardAnt keyboardAnt requested a review from ydshieh ydshieh 1 year ago
keyboardAnt keyboardAnt requested a review from stevhliu stevhliu 1 year ago
Merge branch 'main' into usd
88f6877a
keyboardAnt keyboardAnt requested a review from jmamou jmamou 1 year ago
gante
keyboardAnt
qubvel qubvel removed review request from jmamou jmamou 1 year ago
qubvel qubvel removed review request from qubvel qubvel 1 year ago
gante gante removed review request from zucchini-nlp zucchini-nlp 1 year ago
gante gante removed review request from molbap molbap 1 year ago
gante gante removed review request from eustlb eustlb 1 year ago
gante gante removed review request from amyeroberts amyeroberts 1 year ago
gante gante removed review request from Cyrilvallez Cyrilvallez 1 year ago
gante gante removed review request from muellerzr muellerzr 1 year ago
gante gante removed review request from SunMarc SunMarc 1 year ago
gante gante removed review request from MekkCyber MekkCyber 1 year ago
gante gante removed review request from yonigozlan yonigozlan 1 year ago
gante gante removed review request from ydshieh ydshieh 1 year ago
gante gante removed review request from stevhliu stevhliu 1 year ago
gante
keyboardAnt
keyboardAnt
keyboardAnt
jmamou
zucchini-nlp
zucchini-nlp commented on 2025-01-22
jmamou
jmamou requested changes on 2025-01-23
gauravjain14 Fix minor review comments (#16)
4e3660a1
keyboardAnt Fix: `token_ids.to(torch.int64)` (#18)
c162c885
fix dtype
d0798a08
`assistant_input_ids.to(dtype=torch.long)`
d18d090b
gauravjain14 Remove unused import from test_candidate_generator.py
ae2f16ff
gante
gante commented on 2025-01-17
Merge branch 'main' of https://github.com/keyboardAnt/transformers in…
02dba319
keyboardAnt resolve pr comments (#19)
49a228f4
formatting
7f76fec3
keyboardAnt Merge branch 'main' into usd
32335a5c
jmamou Merge branch 'main' into usd
1a796471
keyboardAnt
jmamou Merge branch 'main' into usd
78a2a2cf
jmamou Fix Joao's comments (#21)
751a0994
jmamou Merge branch 'main' into usd
bfb636d3
jmamou Merge branch 'main' into usd
00e325d7
jmamou fix style (#23)
8a39f5bc
jmamou Merge branch 'main' into usd
64c95fe4
jmamou Move atm (#24)
7661fc92
jmamou fix logit_processor
503ece9c
jmamou add atm_translator test
fb7187db
jmamou refactor test
dedcf982
jmamou Merge branch 'main' into usd
7e3f3dcd
jmamou Merge branch 'main' into usd
c9fc5a61
jmamou Merge branch 'main' into usd
94e8a317
jmamou remove threading from test
4e234706
jmamou Merge branch 'main' into usd
eae175c8
gante
jmamou add require_torch in tests
67849311
jmamou Merge branch 'main' into usd
d20f07b7
jmamou
gante
gante commented on 2025-02-26
jmamou move AssistantVocabTranslatorCache + add tests
be79a153
jmamou Merge branch 'main' into usd
9cb0a3aa
gante
gante approved these changes on 2025-02-26
jmamou ruff fix
b0e7a16b
jmamou Merge branch 'main' into usd
683bbee8
jmamou
gante gante merged d18d9c32 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone