[core] Fix torchao loading #42244
cleanup
b4ef14c2
fixes
b6027426
small upstead
a6934175
i was just missing a "clone" :)
b82c4f25
kill poool asap
c9417f98
nits
58fc7b57
ruff
b01dd4fd
fix modular
7b64815c
fix-copies
66713331
quantization works
fe220cf1
fixes
c6bb839d
updates
2fe87ce1
updates
466df965
update
6f6deb0f
fix fp8, it now works
0519e21d
fix-copies
7efb487d
nits
62ccfd9b
support tp dtensor
8e74adc4
local changes
a5859af4
fix tie weight embeddding?
c3f54372
fix auto for mps
a8998de3
current updates
9735c6e0
small update
965b0066
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
ec49d733
Youhou
a92cb1fe
fix fp8
653933c2
TP + QUANTIZE now works
ac1af432
the way to make local tensor + Dtensor work
aa0ebbec
nit
e1eb5a4a
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
de097799
move progress
edeacc38
fix llama tests ?
f1312dc9
smoll QOL
c53755fc
ship most fixes
22145750
fix bunch of tests
3cde7b06
fix copies
17f25f9f
styling
134959c1
yups
0402e564
Merge branch 'main' of github.com:huggingface/transformers into refac…
44436589
small updates
6c9fda4e
add qwen2_moe to the mapping!
28a1d225
nit
8cf96946
small nits
a01ad8d6
update
9f615bcc
up
fe9b0478
fix olmoe
d9bb0e34
fix ernie
50a85efd
more fixups
9bed4886
updates
912dd2f7
revert small granite moe stuff
48c85c78
yups
00e36042
update conversion mapping!
edf96f84
licence
c3c534fe
smal nit
63093470
update
b320474e
up
5d4d27e6
Apply suggestion from @LysandreJik
00846a2e
updates based on review
f4775fca
better error handling (Am I too rust-y) ?
e0fd1e42
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d34482c6
Apply suggestion from @LysandreJik
904283dd
Apply suggestion from @LysandreJik
b225885f
small nits
7f196f93
fix tie weight keys?
6d0aa663
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
ef5123b8
nit
9f5ec4ac
fix glob import
2d84aba1
fix import and error
573af759
up
e848ab61
update
1d4411aa
up
3e4d8ea9
up
07e265d1
did not know glob was only 3.13
913171a9
fak
e465bc0a
how many tests does this fix?
19f94d0f
cleanup
29e017d5
qol + nits
70619569
fixup
0ebb1b62
nit
6b398e14
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
e59b1fff
merge
52d85e0f
Merge branch 'main' of github.com:huggingface/transformers into refac…
29aa0515
small updates?
20b6142a
cleanup what is no longer used
a79de848
nits
606452d6
dtype
7eda8aa7
up
b148577e
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
9022bc29
upsates
0da6e927
qol
9cb0432c
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
4d34cedf
Merge branch 'main' of github.com:huggingface/transformers into refac…
c515eb6d
fix triton import error
85973fc9
fixup
9b6a7a44
lol so much time lost on this shit
3baf4b7f
nits
82a35bcc
fix the init of param
6c88206d
ah actually we don't discard lm head if missing -> needs to be moved …
4d797099
fix some tests
d1e84db3
small fixes
f2938df8
up
22fcdaf9
up
7d78aa1b
dik why we tie weights twice but,..,,.
80517f53
ups
2ff85326
removeunused
d923061e
fix hunyuan
ce8c1c19
small fix
23e3ed74
nits
a8fb5540
ish
ab6ee8ae
up
77ccbb17
rev
8a8beff7
fix more tie weights keys
02386ce7
small fixes
1c87945a
nit
00b95ee0
update
a170f290
fix and fix
8b924a3b
fix a test
8f7b1d02
glubs
93862177
current shitty changes
4894a257
ship validated ones
da7dc100
more
d7c81717
more update
e0884089
more
4f212de4
more
dc5a22c2
more
675b2bca
mllama
f85f2397
more up
76b6a92d
fix ernie
ba1a8b64
fix xopies
ba3de5ad
up more
8fd255c7
more fixes
5d7507b1
up
0fb23403
up
32b92738
fix-copies
0b95826c
fix more
5794d27d
more updates
5e71bd4a
AI UPDATE
20d1b340
up
89846e7d
hoey
a581fd75
make it fast
1652c9c5
fix
dcad7030
lol
c921cede
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
50714d8c
fix asjusting
8936cc40
more fixes
5c54332e
_dtype nit
ff108789
up
9601b82c
nit
db02b9d7
update
42fd4c43
update
45271710
remove semaphores
bd362112
fix import to avoid jit execution
e2aefee7
try to remove custom tiing logic when its stupid
74a0e9c7
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
ead2ac37
fix more individual models
e7165da0
fix whisper as well
2ff765e9
fix?
912562c0
fox umt5
c43495a5
improve tqdm bar
57988f25
cleanup a bit
8c16de16
oupsi
b8927d67
some updates
2733ff69
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
8baa3fe9
improve
d91701f7
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
5146dec4
remove all buffering -> much faster without it
acc5b245
remove some tie_weights custome funcs when not needed
58389a1f
more fixes related to strict matching regex
92c0229a
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d9e7fe65
remove ALL custom tie weights
b57d7897
small update
ef8b6c35
revert change to init scheme (no need for params)
a228fd0a
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
07574ddd
fix
710b1fff
mixtral init
2526cc5d
try less strict source check
6cb37940
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
e4cadfb1
tied weight first shot to the fiiiixxxxxx
3fea8658
does this help?
82f94b8a
:)
84dd6eb2
fix some ppolry defined tied_weights_keys for now
cc081954
fixes for more models torch_bc
f72f96d4
nits and fixes
e3415292
last update
0e51decd
Revert "tied weight first shot to the fiiiixxxxxx"
0f022b59
here we go again
1dabb4c3
an attempt
0c2b667d
up?
c48e1edb
nits
d2236356
Fix bnb loading !
bdbc01a6
rm print
399388d1
Merge branch 'refactor-weight-loading' into fix-bnb
acbeeae7
subclass nn.Parameters
f692f4bd
up
2fa058fe
lol
78d46227
Ouiiii
8ff4ad56
fix led
32226787
fix long cat flash
9a76a6ee
fix qwen and long cat flash
9fde9f78
properly fix qwen init
074a449f
just push this for now
dde5500d
propnet is dumb
0e7d2d05
update
18b02eea
rm import
e16da231
update
386e259b
push
9c0db728
Merge remote-tracking branch 'upstream/refactor-weight-loading' into …
9788014a
Update src/transformers/core_model_loading.py
72eff97c
remove explict sharing of some tied keys.
75d3afcb
update decoder.bias
85ab0859
moe case
443573ae
Fix loadedparam
d841a04b
Merge remote-tracking branch 'upstream/fix-bnb' into fix-bnb
e235eedd
rm report
e4df7526
more changes to untangle old hardcoded ting
f8f09734
fixup
5c9d56cb
Merge branch 'main' into refactor-weight-loading
a0029f20
fix big faileurs
44943fb8
Fix tests single gpu
3e696222
should fix it
a0525133
fix prophnet
76d66be5
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d176b489
fix resize token embeddings
3ffc59ef
nits
2a00e493
fix xcodex
f7d0183d
asyncio?
bbf5b000
fix smart apply
04128324
fix data-2-vec
c137ea33
[build-ci-image]
7b7c9903
checkout
de74aebb
uupdate
94a53d4c
Merge branch 'refactor-weight-loading' into fix-bnb
db4fe31d
fix hunyuan
8755a4be
update error message
5be67b96
fix deformable detr
86a4e516
fixes
09bcd2ee
fix init weights for non param gate up projs
7b457fd0
shared todo?
e033947a
guard needed for compressed-tensors
9fa1b7a2
Merge branch 'refactor-weight-loading' into fix-bnb
ea5822db
deal with buffers
5881d8eb
update some models
f93f3570
big revert, don't break this behaviour
2f0a6aed
ty @SunMarc this fixes the buffers
3c8c7572
mt5 fuck
f5a7c33d
Merge branch 'refactor-weight-loading' into fix-bnb
36514602
Merge branch 'refactor-weight-loading' into fix-bnb
00b00448
fix
7d8df526
first
01036272
don't initialize
055aef86
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub