transformers
🚨 [v5] Refactor RoPE for layer types
#39847
Merged

🚨 [v5] Refactor RoPE for layer types #39847

zucchini-nlp
zucchini-nlp update
30aaa216
ArthurZucker
ArthurZucker commented on 2025-08-05
zucchini-nlp batch update model code
0e5d07bf
zucchini-nlp typos
0712f629
zucchini-nlp too many diffs, dump
4fc73559
zucchini-nlp dump again
b6162401
zucchini-nlp another dump
06cd2a87
zucchini-nlp fix copies
f66ad573
zucchini-nlp make `rope_scaling_dict` self attr
7dc077ff
zucchini-nlp fix a few more tests
4ac0f189
zucchini-nlp another update
9ad42e93
zucchini-nlp fix a few more tests, hopefully last ones
98944d5d
zucchini-nlp fox copies
12137694
zucchini-nlp a huuuge merge conflict resolved!
d787da72
HuggingFaceDocBuilderDev
zucchini-nlp fix copies again
00d4b3d2
zucchini-nlp fix newly added models, I hate rebasing on main
f9d4de3f
zucchini-nlp update config files
d695f5a1
zucchini-nlp modular files
303f218f
zucchini-nlp fix rope utils test
3229fbae
zucchini-nlp docstring has to be indented more, why?
1914d826
zucchini-nlp oops forgot to update some modualr files
fccb637e
zucchini-nlp copy from doesn't copy decorators?
709c414e
zucchini-nlp fix overriden test as well
b00d90c3
zucchini-nlp add a new test
c8120cf6
zucchini-nlp fix failing tests again
a352362f
zucchini-nlp update docstrings
2f54cb36
zucchini-nlp fix phi3
11edd474
zucchini-nlp Merge branch 'main' into rope-refactor-version-2
6bc850d3
zucchini-nlp fix two models
206f03d1
zucchini-nlp fix copies
8a9085fd
zucchini-nlp forgot to add
88328d2a
zucchini-nlp
github-actions
zucchini-nlp
zucchini-nlp
zucchini-nlp zucchini-nlp requested a review from ArthurZucker ArthurZucker 160 days ago
zucchini-nlp stupid bug from modular conversion
53247489
zucchini-nlp Merge remote-tracking branch 'upstream/main' into rope-refactor-versi…
62518de1
ArthurZucker
ArthurZucker commented on 2025-08-21
ArthurZucker
ArthurZucker
zucchini-nlp fix slow tests
01e79a89
zucchini-nlp
zucchini-nlp
github-actions
ArthurZucker
zucchini-nlp
ArthurZucker
zucchini-nlp update to call rotary emb once per model forward
1a4ccc76
zucchini-nlp 3K tests failing?!
13d30ad9
zucchini-nlp update
df6a3c56
zucchini-nlp update more models
5c129c53
zucchini-nlp fix copies
c54bf4a1
zucchini-nlp fix the rest of tests hopefully
9b2b3577
zucchini-nlp merge main
373a8f37
zucchini-nlp fix after rebase
54ad1dce
zucchini-nlp fix the rope tests
c576edb0
zucchini-nlp fix docs omni
dc9ad8d0
zucchini-nlp change a bit
686b1a7e
zucchini-nlp models with layer types
182a6001
zucchini-nlp why it was deleted?
fdf68186
zucchini-nlp fix a few tests
c786413a
zucchini-nlp fix last test!
722541f3
zucchini-nlp delete extra empty lines
6a7321e3
zucchini-nlp add a test case
5a9cb987
zucchini-nlp more changes
403f56fd
zucchini-nlp fix models
a6c41247
zucchini-nlp typing hint for nested rope params
193eb23f
zucchini-nlp merge main
d1eeb420
zucchini-nlp missed when resolving conflicts
235ef331
zucchini-nlp
zucchini-nlp zucchini-nlp requested a review from gante gante 123 days ago
zucchini-nlp zucchini-nlp changed the title [WIP] RoPE refactor Refactor RoPE for layer types 123 days ago
zucchini-nlp
zucchini-nlp delete layer types and fix typo
6efc1832
zucchini-nlp fix copies
cf7097d4
zucchini-nlp fix copies
234f2333
zucchini-nlp
gante
gante commented on 2025-10-01
zucchini-nlp update docs text
a425778b
zucchini-nlp docs
8886f85b
zucchini-nlp huuge update all models
52192d9d
zucchini-nlp fix copies
74a4b4f3
zucchini-nlp rename attr to align with new format
362e88db
zucchini-nlp delete redundant rope tests
f458fd06
zucchini-nlp trigger ci
a7cf9928
zucchini-nlp merge main
05cd1ec1
zucchini-nlp update the case
f3172faf
zucchini-nlp this is why i hate rebasing
5739b6eb
zucchini-nlp maybe fixed?
ecde27c7
zucchini-nlp oops
9ecfa5ee
zucchini-nlp now fix?
5e379579
zucchini-nlp fix last tests and copies
2b279e07
zucchini-nlp
zucchini-nlp zucchini-nlp requested a review from gante gante 110 days ago
zucchini-nlp
gante
zucchini-nlp
zucchini-nlp merge main
96172118
gante
gante approved these changes on 2025-10-08
zucchini-nlp fix copies?
878a9335
zucchini-nlp fix minimax and gemma3n
617f1aec
zucchini-nlp update typo
b983014b
zucchini-nlp deprecation end version
f7c50438
zucchini-nlp final fix copies :fingers-crossed:
07fa6303
zucchini-nlp oh my, add the docs in toctree
89beae3a
zucchini-nlp oke, this is really the last fix
721de32a
zucchini-nlp
zucchini-nlp zucchini-nlp changed the title Refactor RoPE for layer types 🚨 [v5] Refactor RoPE for layer types 108 days ago
zucchini-nlp kill me please...
d530a860
zucchini-nlp fix copies and hope that tests won't start failing again
a5b397ba
zucchini-nlp use rope scaling if saved
c66337c5
zucchini-nlp fix slow tests
1bfc0133
gante
zucchini-nlp fix cwm and unrelated deepseek
baf23d96
zucchini-nlp Merge branch 'main' into rope-refactor-version-2
3de40c3f
zucchini-nlp fix last
87278a9d
zucchini-nlp
github-actions
ArthurZucker
ArthurZucker commented on 2025-10-13
zucchini-nlp
zucchini-nlp update
9dcd94d8
zucchini-nlp hope it works now, it took so long
dca1b94f
zucchini-nlp lets keep None for now, I will try to remove after checking tests
fcdff3ba
zucchini-nlp some more fixes, i find and replace does not always find all cases
9f6d9633
zucchini-nlp last fix of tests
ff23a682
ydshieh Merge branch 'main' (commit b84c0b31) into rope-refactor-version-2
e4617dc1
zucchini-nlp arthur's comment for extra foreward kwargs
3c955d2c
zucchini-nlp delete unused code
b4186d1f
ydshieh
ydshieh
github-actions
zucchini-nlp Merge branch 'main' into rope-refactor-version-2
cd175981
zucchini-nlp
github-actions
ArthurZucker
ArthurZucker commented on 2025-10-16
zucchini-nlp fix slow qwen tests
717ddc6a
zucchini-nlp delete layer types from models
5fdae079
zucchini-nlp faulty modular conversion
51ca43ab
zucchini-nlp fix qwen omni
080a7423
zucchini-nlp merge main
10493054
zucchini-nlp fix copies and style
5f126b71
github-actions
ArthurZucker
ArthurZucker approved these changes on 2025-10-16
zucchini-nlp
zucchini-nlp commented on 2025-10-16
zucchini-nlp
zucchini-nlp commented on 2025-10-16
zucchini-nlp
zucchini-nlp commented on 2025-10-16
zucchini-nlp
zucchini-nlp address my comment
1cba3b83
zucchini-nlp
github-actions
zucchini-nlp
github-actions
zucchini-nlp
zucchini-nlp zucchini-nlp merged 10de06da into main 100 days ago
BakerBunker
ydshieh
zucchini-nlp zucchini-nlp added for_v5?

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone