transformers
Generate: Add new decoding strategy "DoLa" in `.generate()`
#29619
Merged

Generate: Add new decoding strategy "DoLa" in `.generate()` #29619

gante merged 33 commits into huggingface:main from voidism:main
voidism
gante gante requested a review from gante gante 1 year ago
gante
gante commented on 2024-03-13
voidism
HuggingFaceDocBuilderDev
gante
gante commented on 2024-03-20
gante gante requested a review from amyeroberts amyeroberts 1 year ago
voidism
gante
gante approved these changes on 2024-03-21
voidism
voidism
voidism
amyeroberts
amyeroberts commented on 2024-03-21
voidism
amyeroberts
amyeroberts commented on 2024-03-28
github-actions
gante
voidism
gante
github-actions
voidism
amyeroberts
amyeroberts commented on 2024-05-24
amyeroberts
amyeroberts approved these changes on 2024-05-24
voidism
github-actions
gante
voidism
voidism add support for DoLa decoding
5c358932
voidism add docs; remove deprecated function
bb66df53
voidism add test code for DoLa decoding
9769d6d3
voidism update docs and paper link
4dea208c
voidism solved the issues that made tests failed on CircleCI
1bcdf79b
voidism ruff reformatted
c31d6646
voidism update DoLa decoding; test cases for llama/mistral/mixtral/gemma; docs
8ea188b5
voidism fix formatting; fix failed test cases
76708c88
voidism fix other test cases
419be60a
voidism ruff reformatted; mamba cache issue unsolved
69a11e6d
voidism remove keyword argument 'model_inputs' to match upstream changes
87fc406c
voidism improve documentation
efe3e34c
voidism fixed suggestions from @gante
8654787b
voidism ruff reformated
7fcf9903
voidism moved config warning of dola generation from `utils.py` to `configura…
21a646a6
voidism fixed suggestions from @amyeroberts
2b81d731
voidism fixed format issue; removed print; added explanation
cc067cca
voidism remove trailing whitespace
679afeac
voidism ruff reformat to pass test
7e9367b3
voidism fixed suggestions from @amyeroberts on Mar 28
4bd54c8f
voidism fix failed CI tests
92de5cd7
voidism ruff reformatted; fixed missing argument generation_config
cff56616
voidism make `dola_layers` not optional
ba61bf47
gante fix divergence w main
87ea8d8c
gante fix dola test on mamba
57c89afd
gante rwkv test (wont fix
1c449006
gante slow tests running in fp16
9d9f8945
gante gante force pushed to 9d9f8945 1 year ago
gante
gante make fixup
a8993ef5
gante remove redundant fn
aaf560f9
gante final rebase divergences
dc2192c3
gante this one was missing
520202db
gante a few more nits
ce64a5f1
gante skip stateful models
8b6653cb
gante
gante gante merged d094d8d9 into main 1 year ago
gante
voidism
gante
TccccD

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone