transformers
Fix bnb for the weights refactor
#42043
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
396
Changes
View On
GitHub
Fix bnb for the weights refactor
#42043
ArthurZucker
merged 396 commits into
main
from
fix-bnb
small fix
23e3ed74
nits
a8fb5540
ish
ab6ee8ae
up
77ccbb17
rev
8a8beff7
fix more tie weights keys
02386ce7
small fixes
1c87945a
nit
00b95ee0
update
a170f290
fix and fix
8b924a3b
fix a test
8f7b1d02
glubs
93862177
current shitty changes
4894a257
ship validated ones
da7dc100
more
d7c81717
more update
e0884089
more
4f212de4
more
dc5a22c2
more
675b2bca
mllama
f85f2397
more up
76b6a92d
fix ernie
ba1a8b64
fix xopies
ba3de5ad
up more
8fd255c7
more fixes
5d7507b1
up
0fb23403
up
32b92738
fix-copies
0b95826c
fix more
5794d27d
more updates
5e71bd4a
AI UPDATE
20d1b340
up
89846e7d
hoey
a581fd75
make it fast
1652c9c5
fix
dcad7030
lol
c921cede
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
50714d8c
fix asjusting
8936cc40
more fixes
5c54332e
_dtype nit
ff108789
up
9601b82c
nit
db02b9d7
update
42fd4c43
update
45271710
remove semaphores
bd362112
fix import to avoid jit execution
e2aefee7
try to remove custom tiing logic when its stupid
74a0e9c7
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
ead2ac37
fix more individual models
e7165da0
fix whisper as well
2ff765e9
fix?
912562c0
fox umt5
c43495a5
improve tqdm bar
57988f25
cleanup a bit
8c16de16
oupsi
b8927d67
some updates
2733ff69
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
8baa3fe9
improve
d91701f7
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
5146dec4
remove all buffering -> much faster without it
acc5b245
remove some tie_weights custome funcs when not needed
58389a1f
more fixes related to strict matching regex
92c0229a
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d9e7fe65
remove ALL custom tie weights
b57d7897
small update
ef8b6c35
revert change to init scheme (no need for params)
a228fd0a
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
07574ddd
fix
710b1fff
mixtral init
2526cc5d
SunMarc
requested a review
from
MekkCyber
69 days ago
try less strict source check
6cb37940
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
e4cadfb1
tied weight first shot to the fiiiixxxxxx
3fea8658
does this help?
82f94b8a
:)
84dd6eb2
fix some ppolry defined tied_weights_keys for now
cc081954
fixes for more models torch_bc
f72f96d4
nits and fixes
e3415292
last update
0e51decd
Revert "tied weight first shot to the fiiiixxxxxx"
0f022b59
here we go again
1dabb4c3
MekkCyber
commented on 2025-11-06
an attempt
0c2b667d
up?
c48e1edb
nits
d2236356
Fix bnb loading !
bdbc01a6
rm print
399388d1
Merge branch 'refactor-weight-loading' into fix-bnb
acbeeae7
SunMarc
requested a review
from
ArthurZucker
67 days ago
SunMarc
requested a review
from
Cyrilvallez
67 days ago
SunMarc
changed the title
Fix bnb on the fly for the weights refactor
Fix bnb for the weights refactor
67 days ago
ArthurZucker
commented on 2025-11-06
matthewdouglas
commented on 2025-11-06
subclass nn.Parameters
f692f4bd
up
2fa058fe
lol
78d46227
Ouiiii
8ff4ad56
fix led
32226787
fix long cat flash
9a76a6ee
fix qwen and long cat flash
9fde9f78
properly fix qwen init
074a449f
just push this for now
dde5500d
propnet is dumb
0e7d2d05
update
18b02eea
rm import
e16da231
update
386e259b
push
9c0db728
Merge remote-tracking branch 'upstream/refactor-weight-loading' into …
9788014a
Update src/transformers/core_model_loading.py
72eff97c
remove explict sharing of some tied keys.
75d3afcb
update decoder.bias
85ab0859
moe case
443573ae
Fix loadedparam
d841a04b
Merge remote-tracking branch 'upstream/fix-bnb' into fix-bnb
e235eedd
SunMarc
commented on 2025-11-07
rm report
e4df7526
more changes to untangle old hardcoded ting
f8f09734
fixup
5c9d56cb
Merge branch 'main' into refactor-weight-loading
a0029f20
fix big faileurs
44943fb8
Fix tests single gpu
3e696222
should fix it
a0525133
fix prophnet
76d66be5
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
d176b489
fix resize token embeddings
3ffc59ef
nits
2a00e493
fix xcodex
f7d0183d
asyncio?
bbf5b000
fix smart apply
04128324
fix data-2-vec
c137ea33
[build-ci-image]
7b7c9903
checkout
de74aebb
uupdate
94a53d4c
Merge branch 'refactor-weight-loading' into fix-bnb
db4fe31d
fix hunyuan
8755a4be
update error message
5be67b96
fix deformable detr
86a4e516
fixes
09bcd2ee
fix init weights for non param gate up projs
7b457fd0
shared todo?
e033947a
guard needed for compressed-tensors
9fa1b7a2
Merge branch 'refactor-weight-loading' into fix-bnb
ea5822db
deal with buffers
5881d8eb
update some models
f93f3570
big revert, don't break this behaviour
2f0a6aed
ty @SunMarc this fixes the buffers
3c8c7572
mt5 fuck
f5a7c33d
fix lxmbert
647f720a
nuke slow test fetcher
bed6ea1c
Merge branch 'refactor-weight-loading' into fix-bnb
36514602
Merge branch 'refactor-weight-loading' into fix-bnb
00b00448
fix
7d8df526
fix zamba and deepcopy for now
2ec0a5fb
fix zamba tied weight keys! ~
f9c7ef87
fix-copies
8df3ffd8
update fetch terst
e76481b9
fix gradient for test modeling common!
de007511
break "shared" for now I will fix tomorrow changes are properly isoal…
cdd1a9b3
does this fix marian? probably not
d3f64762
fix some vlms
0a7db831
D fine seems to handle this well
18142005
glob is fine actually
b77825d3
fix dab detr
5dbb7833
small steps
9edc81b8
opusy
970f4e53
fix some more models?
0361d47d
yups
dc757737
better erro
cdb12846
fix?
de9a2d98
fix double escape
b9a9f4d8
escape wehere it makes sense
c944619e
??
f9105240
fix ibert
4aa2ade0
fix tvp as well
2ef1c2b2
more fxes
b98a7bce
try always download ref PR
74e6c871
ONONONO
5064edd1
big fixup
3f8a304c
more fixup
3ecaa63d
small step
f384524e
small nits
290337a2
nits
76b388c9
brut force some stuff
e69b988e
fix vilt
c2781f57
make sure special models that always need tie always tie
f64ee960
cleaning up
a3e40152
small nits
9eecbd27
ArthurZucker
commented on 2025-11-10
fix zamba and bridge tower!
b2fa432b
just fixup
dbbfdf29
potential culprits
ab4890c8
revert bark and fix bridgetower
937ebf36
Merge branch 'main' of github.com:huggingface/transformers into refac…
e4f9697f
remove now non existant tie_weights
17803ce9
?
9f6838a2
lol reformer actually had nothing tied!
1afb3eb5
wow these two fucking models were really not well made
f01a149a
fix sam family!
0b369802
fix bark revision
d740c82b
fix speech2test ?
6f3940ee
push this for now....
b2f6f61a
upsy
ade8dab4
the fuck
f956ccfb
fix rtdetr
99c6fd49
update
1ffcfc3f
proper
ee62aec5
wow that one 's annoying
6ec80f86
update
b05e3290
try to find the culprit
2606596f
get some help on common
d9e8a09d
nit about general init and cls.padding_idx
581665ae
revert num workers update
c43bc687
remove old loading func
b6fe4158
fix glob
4bb8e5c9
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
7d52b063
add annotations
455bcc7c
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
fc884c03
fix re
2e0ed5d2
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
3ddd1cca
small improvements
1f86a104
fix conflict
4d56fbf1
clean some stuff
67a8eebb
improvements
e9168ff5
someone did not understannnnnnd what I tried to dooo or does BNB not …
feda22d9
Merge branch 'refactor-weight-loading' of github.com:huggingface/tran…
70841c9f
gluos
52248ba3
fix case when `.` is just not there
e8dd4a45
for now let's do this
5ce08caa
Merge remote-tracking branch 'upstream/refactor-weight-loading' into …
1a2b5ca5
fix
2f1b69c9
fix small test
3c2d946e
Base automatically changed from
refactor-weight-loading
to
main
61 days ago
Merge remote-tracking branch 'upstream/main' into fix-bnb
64f204c2
style
3a5eadde
fix merge conflits
c319aabd
style
cece17f0
8bit fixed ?
27760373
fix
4d105410
SunMarc
requested a review
from
ArthurZucker
60 days ago
ArthurZucker
commented on 2025-11-17
fix 8bit dtype
a574c903
fix
e0f01bac
rm copy
f9a3ae49
Apply suggestions from code review
bf51aa16
style
ebe8b408
test
84dbd079
SunMarc
requested a review
from
ArthurZucker
57 days ago
SunMarc
requested a review
from
MekkCyber
57 days ago
fix
ed4cfb3f
finally ?
f6ec797f
Apply style fixes
1a51bb70
fix
88edf0dd
Merge branch 'fix-bnb' of github.com:huggingface/transformers into fi…
5bf0eef0
Merge remote-tracking branch 'upstream/main' into fix-bnb
a43da182
fix
2bf3a09f
Apply style fixes
11437cbb
ArthurZucker
approved these changes on 2025-11-18
tie weights
3d386b45
Merge branch 'fix-bnb' of github.com:huggingface/transformers into fi…
6f4feeaf
Merge remote-tracking branch 'upstream/main' into fix-bnb
2c45d7cf
warning
bcc929cb
Apply style fixes
fa1273a1
init
d1ff2a7e
SunMarc
commented on 2025-11-18
SunMarc
commented on 2025-11-18
default
6ddbb182
Merge branch 'fix-bnb' of github.com:huggingface/transformers into fi…
e9d8094a
ArthurZucker
merged
67302b04
into main
55 days ago
ArthurZucker
deleted the fix-bnb branch
55 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
ArthurZucker
matthewdouglas
MekkCyber
Cyrilvallez
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub