transformers
Refactor the way we handle outputs for new llamas and new models
#39120
Merged

Refactor the way we handle outputs for new llamas and new models #39120

ydshieh merged 124 commits into main from clean-llamas
ArthurZucker
ArthurZucker just update 2 files
7433c443
ArthurZucker update other models as well just making fix-copies
37b4ef02
ArthurZucker also add the changes needed to modeling utils
7f113b43
ArthurZucker put this on the pretrained model instead
abf9d39d
ArthurZucker nits and fixes
eb6747bc
ArthurZucker update generic, fix to use config value
0f1d7e0a
ArthurZucker update other modelings
e437edd7
ArthurZucker use transformers kwargs instead
96aabd77
ArthurZucker update
63df15bb
ArthurZucker update
98f402cd
ArthurZucker update other models
a7e0ce23
ArthurZucker update
c9bb39ef
ArthurZucker updates
cb5da530
ArthurZucker update
0dc08262
ArthurZucker update
fca73ad7
ArthurZucker update
98739ba4
HuggingFaceDocBuilderDev
ArthurZucker fix
124cd829
ArthurZucker finally
4a14287a
ArthurZucker very small nits
ea87eb70
ArthurZucker this fixes more tests
8c66f4d0
ArthurZucker fix other models as well!
3caf7d76
ArthurZucker ArthurZucker marked this pull request as ready for review 357 days ago
LysandreJik
LysandreJik approved these changes on 2025-06-30
ArthurZucker update modularqwen2
113219be
ArthurZucker update models based on qwen2
e7705c98
ArthurZucker update
a74974d9
ArthurZucker update
3fb6b710
ArthurZucker remove the **flash stuff in favor of noraml kwargs
7266aafa
vasqu
vasqu approved these changes on 2025-06-30
ArthurZucker update
c7d195fe
ArthurZucker propagate gemma?
e63ef640
ArthurZucker remove output attentions
1303470a
ArthurZucker propagate
063e510d
ArthurZucker
ArthurZucker commented on 2025-06-30
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into clean…
8c96926f
ArthurZucker support cross attention edge case
01d4da85
ArthurZucker same
780141ca
ArthurZucker test this
3c0c56b8
ArthurZucker fixes
7a0512a1
ArthurZucker more fix
a13a98c6
ArthurZucker update
15a8ff4f
ArthurZucker update
22423738
ArthurZucker update
2748b993
ArthurZucker fix conflicts
da50ccc5
ArthurZucker update
209d5022
ArthurZucker fix emu3
10fb88ae
ArthurZucker fix emu3
00afce98
ArthurZucker move the fix a bit
3ac6c52f
ArthurZucker quel enfer
0b119ffb
ArthurZucker some fixes, loss_kwargs should never had been
f7a1f0da
ArthurZucker finish fixing gemma3n
6a132a07
ArthurZucker fix small lm3
9fa5f266
ArthurZucker fix another one
aaae861f
ArthurZucker fix csm now
5e5ae84a
ArthurZucker fux csm and mistral
075bd0c2
ArthurZucker fix mistral now
d04c2b1a
ArthurZucker small fixes
5065b9a2
ArthurZucker fix janusss
6a5f410d
ArthurZucker only for some models
4834aeca
ArthurZucker fixup
d8ee27e4
ArthurZucker phix phi3
e2973440
ArthurZucker more fixes?
0c9f6de0
ArthurZucker dose this fix it?
501aead2
ArthurZucker update
253307a3
ArthurZucker holy shit it was just graph breaks
a267d8d4
ArthurZucker protect torch
17cf5424
ArthurZucker updates
c4d43c53
ArthurZucker fix samhq?
4fc83fa3
ArthurZucker fix moonshine
499ae87e
ArthurZucker more moonshine fixes, 3 failures left!
b3c8641f
ArthurZucker nits
b81df9bd
ArthurZucker generic needs to support more
cfe62b6b
ArthurZucker more fixes to moonshine!
6eb5e53e
ArthurZucker fix cross attention outputs!
a9690f43
ArthurZucker fix csm!
d462a8ea
ArthurZucker nits
0f3c3683
ArthurZucker fix stupid kosmos2
3cba8ac3
ArthurZucker current updates
5af5bccd
ArthurZucker fixes
9968c85e
ArthurZucker use output recorder?
fbfaf040
ArthurZucker nicer!
1f559c67
ArthurZucker a little bit of magic
cd63172c
ArthurZucker update
cf2e98c9
ArthurZucker fix protect
c278e1cb
ArthurZucker fix
e3c82cb7
ArthurZucker small fixes
c5592be0
ArthurZucker protect import
f6190cbf
ArthurZucker fix a bunch of more models
d0be3319
ArthurZucker fix fixups
22f0eaea
ArthurZucker fix some of the last ones
422122d6
ArthurZucker nit
feba9a03
ArthurZucker partly fix phi
9a3708ae
ArthurZucker update
7a0f14a7
ArthurZucker fix import path
c4f314b3
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into clean…
c6c5efbe
ArthurZucker make something that is fullgraph compatible just to be sure
5f3722cf
ArthurZucker typing was wrong on llama so the rest was wrong as well
7781368d
ArthurZucker fucking ugly but at least it is still exportable
c9493081
ArthurZucker syle
eaa7392b
ArthurZucker supposed to fix moonshine, it still breaks
4b6a535c
ArthurZucker fix some default
9976ed8b
ArthurZucker fix the last bits of sam
6d723988
ArthurZucker update samhq
ddea6837
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into clean…
f021967a
ArthurZucker more fixes to am hq
2e296b55
ArthurZucker nit
8aaa10e2
ArthurZucker fix all output+hidden states and output_attentions!
b8d6666d
ArthurZucker fix?
cb16ef8c
ArthurZucker fix diffllama
faf2a427
ArthurZucker updates to fix initialization on the sam pips
6c83dcc7
ArthurZucker ups there was a bug
bd567297
ArthurZucker fix the last sam hq test
4213b183
ArthurZucker fix gotocr
df766048
ArthurZucker fix gotocr2!
a50382b3
ArthurZucker fixes
73d74500
ArthurZucker skip stupid tests
59ba6fab
ArthurZucker there was one left :)
e9a3e47f
ArthurZucker fixup
141a01ff
ArthurZucker fix fix copies issues with this test file
cb7a8815
ArthurZucker fix copies for sam_hq
90e36aae
ArthurZucker rm some comments
459062f0
ArthurZucker skip 2 more failing tests
2614116e
ArthurZucker fix
f5695c00
ArthurZucker fix everything
da4875af
molbap
molbap approved these changes on 2025-07-04
ArthurZucker Apply suggestions from code review
44da8484
ArthurZucker add more doc!
c209f7ed
ArthurZucker fix public init
4ae2049d
ArthurZucker fix modular qwen3
7548ec23
github-actions
qubvel
qubvel commented on 2025-07-04
ydshieh ydshieh merged ca7e1a37 into main 352 days ago
ydshieh ydshieh deleted the clean-llamas branch 352 days ago
chenhengqi
ArthurZucker
chenhengqi
ShaohonChen
ArthurZucker

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone