transformers
Add recurrent gemma
#30143
Merged

Add recurrent gemma #30143

ArthurZucker merged 97 commits into main from add-recurrent-gemma
ArthurZucker
molbap Merge pull request #9 from huggingface/update
e536f6a3
botev Fork.
c8dfc84e
botev RecurrentGemma initial commit.
a5821870
botev Updating __init__.py.
a6ba85f7
botev Minor modification to how we initialize the cache.
0ef8cca9
botev Reformat code to 4 spaces.
102770e5
botev Fixed the forward pass.
67710b01
botev Fixed the RecurrentGemmaForCausalLM
6427da15
botev Minor comment that we might not need attention_mask and output_attent…
0b57d56c
botev Now cache should work as well.
4056277f
lberrada Adding a temporary example to check whether the model generation works.
b3358c9a
botev Adding the tests and updating imports.
91c64f9d
lberrada Adding the example file missing in the previous commit.
19a008e3
botev First working example.
6a78aeff
ArthurZucker Merge branch 'main' of github.com:huggingface/new-model-addition
ef8c0fb4
Nush395 Removing .gitignore and reverting parts of __init__.
41d21dd7
Nush395 Re-add .gitignore.
74c4b860
Nush395 Addressing comments for configuration.
02400311
Nush395 Move mask creation to `_prepare_inputs_for_generation`.
b8e4de48
botev First try at integration tests:
6085aa51
botev Transfoering between machines.
6d1f486f
botev Running normal tests.
211f06d7
botev Minor fix.
e276b26f
botev More fixes.
3887c140
botev Merge pull request #2 from botev/t1
66b3ca6e
Nush395 Addressing more comments.
b62bb8bd
botev Minor fixes.
1644ee1b
ArthurZucker first stab at cleanup
53f61e1c
ArthurZucker more refactoring
9eb77d87
ArthurZucker fix copies and else
94441f08
ArthurZucker renaming and get init to work
c4dbe980
ArthurZucker fix causal mask creation
23c54411
ArthurZucker update
b68b253a
ArthurZucker nit
951e2bd2
ArthurZucker fix a hell lot of things
e0f3ca7a
ArthurZucker updates
c272ef14
ArthurZucker update conversion script
5ec60e5c
ArthurZucker make all keys importable
8ccf919c
ArthurZucker nits
55870df0
ArthurZucker add auto mappings
3fe9f295
ArthurZucker properly convert ffw_up and down
b819bc92
ArthurZucker add scaling
ff473bf7
ArthurZucker fix generations
91e4215c
ArthurZucker for recurrent dtype
0a4bd574
ArthurZucker update
213c313d
ArthurZucker fix going beyong window
a903ab8a
ArthurZucker fixup
928c1a1c
ArthurZucker add missing files
bb4349b3
ArthurZucker current updates to remove last einops
a490f855
ArthurZucker finish modeling refactor
24abe59f
ArthurZucker TADA
0f1b1beb
ArthurZucker fix compile
5b6a2e62
ArthurZucker fix most failing testt ? ?
57cc9e87
ArthurZucker update tests
8d07eafb
ArthurZucker refactor and update
68583497
ArthurZucker update
5754cbbc
ArthurZucker Merge remote-tracking branch 'botev/add-recurrent-gemma' into recurre…
9479fbc6
ArthurZucker nits, fixup and update tests
f70554bd
ArthurZucker more fixup
3d84644f
ArthurZucker nits
6e0ccfc1
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into recur…
a73ff678
ArthurZucker fix imports
c700406e
ArthurZucker test format
3bd8ed86
ArthurZucker fixups
79527d57
ArthurZucker nits
b81fe88e
HuggingFaceDocBuilderDev
ArthurZucker tuple typing
589c2465
ArthurZucker fix code quality
7dd3b476
ArthurZucker add model card
370f7a64
ArthurZucker fix doc
65d3bdd3
ArthurZucker skip most generation tests
a360c969
ArthurZucker nits
7dfb044e
ArthurZucker style
6085f1de
ArthurZucker ArthurZucker marked this pull request as ready for review 2 years ago
ArthurZucker doc fixes
ef05cc8c
ArthurZucker ArthurZucker requested a review from amyeroberts amyeroberts 2 years ago
ArthurZucker fix pr and check_copies?
a34f905b
ArthurZucker last nit
b0d2c981
ArthurZucker oupsy
740a653a
LysandreJik
LysandreJik approved these changes on 2024-04-10
ArthurZucker Apply suggestions from code review
77963bad
ArthurZucker update
cc80f2d0
amyeroberts
amyeroberts commented on 2024-04-10
ArthurZucker Update src/transformers/models/recurrent_gemma/convert_recurrent_gemm…
308f3ead
ArthurZucker Update tests/models/recurrent_gemma/test_modeling_recurrent_gemma.py
759bebe1
ArthurZucker Update tests/models/recurrent_gemma/test_modeling_recurrent_gemma.py
4011bb04
ArthurZucker Update tests/models/recurrent_gemma/test_modeling_recurrent_gemma.py
f153601b
ArthurZucker Update tests/models/recurrent_gemma/test_modeling_recurrent_gemma.py
2505ff84
ArthurZucker update based on review
a8849967
ArthurZucker Merge branch 'add-recurrent-gemma' of github.com:huggingface/transfor…
9ad73e6c
ArthurZucker doc nit
5a2ea8de
ArthurZucker fix quality
c77723ef
ArthurZucker quality
19121e42
ArthurZucker fix slow test model path
c0af8880
ArthurZucker update default dype
e20fb42d
ArthurZucker ignore attributes that can be safely ignored in check config attributes
acf2c372
ArthurZucker 0lallalala come on
542af28c
ArthurZucker save nit
2d26d92c
ArthurZucker style
ea19cfbe
ArthurZucker remove to dict update
ec6e0065
ArthurZucker
ArthurZucker make sure we can also run in float16
9cb0c2dd
ArthurZucker style
4b5ae335
ArthurZucker ArthurZucker merged 0fe44059 into main 2 years ago
ArthurZucker ArthurZucker deleted the add-recurrent-gemma branch 2 years ago
proger
proger commented on 2024-04-10

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone