transformers
[core] Fix torchao loading
#42244
Open

[core] Fix torchao loading #42244

MekkCyber wants to merge 290 commits into main from fix-torchao
MekkCyber
ArthurZucker cleanup
b4ef14c2
ArthurZucker fixes
b6027426
ArthurZucker small upstead
a6934175
ArthurZucker i was just missing a "clone" :)
b82c4f25
ArthurZucker kill poool asap
c9417f98
ArthurZucker nits
58fc7b57
ArthurZucker ruff
b01dd4fd
ArthurZucker fix modular
7b64815c
ArthurZucker fix-copies
66713331
ArthurZucker quantization works
fe220cf1
ArthurZucker fixes
c6bb839d
ArthurZucker updates
2fe87ce1
ArthurZucker updates
466df965
ArthurZucker update
6f6deb0f
ArthurZucker fix fp8, it now works
0519e21d
ArthurZucker fix-copies
7efb487d
ArthurZucker nits
62ccfd9b
ArthurZucker support tp dtensor
8e74adc4
ArthurZucker local changes
a5859af4
ArthurZucker fix tie weight embeddding?
c3f54372
ArthurZucker fix auto for mps
a8998de3
ArthurZucker current updates
9735c6e0
ArthurZucker small update
965b0066
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
ec49d733
ArthurZucker Youhou
a92cb1fe
ArthurZucker fix fp8
653933c2
ArthurZucker TP + QUANTIZE now works
ac1af432
ArthurZucker the way to make local tensor + Dtensor work
aa0ebbec
ArthurZucker nit
e1eb5a4a
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
de097799
ArthurZucker move progress
edeacc38
ArthurZucker fix llama tests ?
f1312dc9
ArthurZucker smoll QOL
c53755fc
ArthurZucker ship most fixes
22145750
ArthurZucker fix bunch of tests
3cde7b06
ArthurZucker fix copies
17f25f9f
ArthurZucker styling
134959c1
ArthurZucker yups
0402e564
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into refac…
44436589
ArthurZucker small updates
6c9fda4e
ArthurZucker add qwen2_moe to the mapping!
28a1d225
ArthurZucker nit
8cf96946
ArthurZucker small nits
a01ad8d6
ArthurZucker update
9f615bcc
ArthurZucker up
fe9b0478
ArthurZucker fix olmoe
d9bb0e34
ArthurZucker fix ernie
50a85efd
ArthurZucker more fixups
9bed4886
ArthurZucker updates
912dd2f7
ArthurZucker revert small granite moe stuff
48c85c78
ArthurZucker yups
00e36042
ArthurZucker update conversion mapping!
edf96f84
ArthurZucker licence
c3c534fe
ArthurZucker smal nit
63093470
ArthurZucker update
b320474e
ArthurZucker up
5d4d27e6
ArthurZucker Apply suggestion from @LysandreJik
00846a2e
ArthurZucker updates based on review
f4775fca
ArthurZucker better error handling (Am I too rust-y) ?
e0fd1e42
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d34482c6
ArthurZucker Apply suggestion from @LysandreJik
904283dd
ArthurZucker Apply suggestion from @LysandreJik
b225885f
ArthurZucker small nits
7f196f93
ArthurZucker fix tie weight keys?
6d0aa663
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
ef5123b8
ArthurZucker nit
9f5ec4ac
ArthurZucker fix glob import
2d84aba1
ArthurZucker fix import and error
573af759
ArthurZucker up
e848ab61
ArthurZucker update
1d4411aa
ArthurZucker up
3e4d8ea9
ArthurZucker up
07e265d1
ArthurZucker did not know glob was only 3.13
913171a9
ArthurZucker fak
e465bc0a
ArthurZucker how many tests does this fix?
19f94d0f
ArthurZucker cleanup
29e017d5
ArthurZucker qol + nits
70619569
ArthurZucker fixup
0ebb1b62
ArthurZucker nit
6b398e14
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
e59b1fff
ArthurZucker merge
52d85e0f
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into refac…
29aa0515
ArthurZucker small updates?
20b6142a
ArthurZucker cleanup what is no longer used
a79de848
ArthurZucker nits
606452d6
ArthurZucker dtype
7eda8aa7
ArthurZucker up
b148577e
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
9022bc29
ArthurZucker upsates
0da6e927
ArthurZucker qol
9cb0432c
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
4d34cedf
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into refac…
c515eb6d
ArthurZucker fix triton import error
85973fc9
ArthurZucker fixup
9b6a7a44
ArthurZucker lol so much time lost on this shit
3baf4b7f
ArthurZucker nits
82a35bcc
ArthurZucker fix the init of param
6c88206d
ArthurZucker ah actually we don't discard lm head if missing -> needs to be moved …
4d797099
ArthurZucker fix some tests
d1e84db3
ArthurZucker small fixes
f2938df8
ArthurZucker up
22fcdaf9
ArthurZucker up
7d78aa1b
ArthurZucker dik why we tie weights twice but,..,,.
80517f53
ArthurZucker ups
2ff85326
ArthurZucker removeunused
d923061e
ArthurZucker fix hunyuan
ce8c1c19
ArthurZucker small fix
23e3ed74
ArthurZucker nits
a8fb5540
ArthurZucker ish
ab6ee8ae
ArthurZucker up
77ccbb17
ArthurZucker rev
8a8beff7
ArthurZucker fix more tie weights keys
02386ce7
ArthurZucker small fixes
1c87945a
ArthurZucker nit
00b95ee0
ArthurZucker update
a170f290
ArthurZucker fix and fix
8b924a3b
ArthurZucker fix a test
8f7b1d02
ArthurZucker glubs
93862177
ArthurZucker current shitty changes
4894a257
ArthurZucker ship validated ones
da7dc100
ArthurZucker more
d7c81717
ArthurZucker more update
e0884089
ArthurZucker more
4f212de4
ArthurZucker more
dc5a22c2
ArthurZucker more
675b2bca
ArthurZucker mllama
f85f2397
ArthurZucker more up
76b6a92d
ArthurZucker fix ernie
ba1a8b64
ArthurZucker fix xopies
ba3de5ad
ArthurZucker up more
8fd255c7
ArthurZucker more fixes
5d7507b1
ArthurZucker up
0fb23403
ArthurZucker up
32b92738
ArthurZucker fix-copies
0b95826c
ArthurZucker fix more
5794d27d
ArthurZucker more updates
5e71bd4a
ArthurZucker AI UPDATE
20d1b340
ArthurZucker up
89846e7d
ArthurZucker hoey
a581fd75
Cyrilvallez make it fast
1652c9c5
Cyrilvallez fix
dcad7030
ArthurZucker lol
c921cede
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
50714d8c
ArthurZucker fix asjusting
8936cc40
ArthurZucker more fixes
5c54332e
ArthurZucker _dtype nit
ff108789
ArthurZucker up
9601b82c
ArthurZucker nit
db02b9d7
ArthurZucker update
42fd4c43
ArthurZucker update
45271710
Cyrilvallez remove semaphores
bd362112
Cyrilvallez fix import to avoid jit execution
e2aefee7
ArthurZucker try to remove custom tiing logic when its stupid
74a0e9c7
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
ead2ac37
ArthurZucker fix more individual models
e7165da0
ArthurZucker fix whisper as well
2ff765e9
ArthurZucker fix?
912562c0
ArthurZucker fox umt5
c43495a5
Cyrilvallez improve tqdm bar
57988f25
Cyrilvallez cleanup a bit
8c16de16
Cyrilvallez oupsi
b8927d67
ArthurZucker some updates
2733ff69
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
8baa3fe9
Cyrilvallez improve
d91701f7
Cyrilvallez Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
5146dec4
Cyrilvallez remove all buffering -> much faster without it
acc5b245
ArthurZucker remove some tie_weights custome funcs when not needed
58389a1f
ArthurZucker more fixes related to strict matching regex
92c0229a
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d9e7fe65
ArthurZucker remove ALL custom tie weights
b57d7897
ArthurZucker small update
ef8b6c35
Cyrilvallez revert change to init scheme (no need for params)
a228fd0a
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
07574ddd
SunMarc fix
710b1fff
Cyrilvallez mixtral init
2526cc5d
ArthurZucker try less strict source check
6cb37940
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
e4cadfb1
Cyrilvallez tied weight first shot to the fiiiixxxxxx
3fea8658
ArthurZucker does this help?
82f94b8a
ArthurZucker :)
84dd6eb2
ArthurZucker fix some ppolry defined tied_weights_keys for now
cc081954
ArthurZucker fixes for more models torch_bc
f72f96d4
ArthurZucker nits and fixes
e3415292
ArthurZucker last update
0e51decd
ArthurZucker Revert "tied weight first shot to the fiiiixxxxxx"
0f022b59
ArthurZucker here we go again
1dabb4c3
ArthurZucker an attempt
0c2b667d
ArthurZucker up?
c48e1edb
ArthurZucker nits
d2236356
SunMarc Fix bnb loading !
bdbc01a6
SunMarc rm print
399388d1
SunMarc Merge branch 'refactor-weight-loading' into fix-bnb
acbeeae7
ArthurZucker subclass nn.Parameters
f692f4bd
ArthurZucker up
2fa058fe
ArthurZucker lol
78d46227
ArthurZucker Ouiiii
8ff4ad56
ArthurZucker fix led
32226787
ArthurZucker fix long cat flash
9a76a6ee
ArthurZucker fix qwen and long cat flash
9fde9f78
ArthurZucker properly fix qwen init
074a449f
ArthurZucker just push this for now
dde5500d
ArthurZucker propnet is dumb
0e7d2d05
ArthurZucker update
18b02eea
SunMarc rm import
e16da231
SunMarc update
386e259b
ArthurZucker push
9c0db728
SunMarc Merge remote-tracking branch 'upstream/refactor-weight-loading' into …
9788014a
SunMarc Update src/transformers/core_model_loading.py
72eff97c
ArthurZucker remove explict sharing of some tied keys.
75d3afcb
ArthurZucker update decoder.bias
85ab0859
ArthurZucker moe case
443573ae
SunMarc Fix loadedparam
d841a04b
SunMarc Merge remote-tracking branch 'upstream/fix-bnb' into fix-bnb
e235eedd
SunMarc rm report
e4df7526
ArthurZucker more changes to untangle old hardcoded ting
f8f09734
ArthurZucker fixup
5c9d56cb
ArthurZucker Merge branch 'main' into refactor-weight-loading
a0029f20
ArthurZucker fix big faileurs
44943fb8
SunMarc Fix tests single gpu
3e696222
SunMarc should fix it
a0525133
ArthurZucker fix prophnet
76d66be5
ArthurZucker Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d176b489
ArthurZucker fix resize token embeddings
3ffc59ef
ArthurZucker nits
2a00e493
ArthurZucker fix xcodex
f7d0183d
ArthurZucker asyncio?
bbf5b000
ArthurZucker fix smart apply
04128324
ArthurZucker fix data-2-vec
c137ea33
ArthurZucker [build-ci-image]
7b7c9903
ArthurZucker checkout
de74aebb
ArthurZucker uupdate
94a53d4c
SunMarc Merge branch 'refactor-weight-loading' into fix-bnb
db4fe31d
ArthurZucker fix hunyuan
8755a4be
ArthurZucker update error message
5be67b96
ArthurZucker fix deformable detr
86a4e516
ArthurZucker fixes
09bcd2ee
ArthurZucker fix init weights for non param gate up projs
7b457fd0
ArthurZucker shared todo?
e033947a
SunMarc guard needed for compressed-tensors
9fa1b7a2
SunMarc Merge branch 'refactor-weight-loading' into fix-bnb
ea5822db
SunMarc deal with buffers
5881d8eb
ArthurZucker update some models
f93f3570
ArthurZucker big revert, don't break this behaviour
2f0a6aed
ArthurZucker ty @SunMarc this fixes the buffers
3c8c7572
ArthurZucker mt5 fuck
f5a7c33d
SunMarc Merge branch 'refactor-weight-loading' into fix-bnb
36514602
SunMarc Merge branch 'refactor-weight-loading' into fix-bnb
00b00448
SunMarc fix
7d8df526
MekkCyber first
01036272
MekkCyber don't initialize
055aef86

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone