Fix model kwargs #35875

muellerzr merged 45 commits into main from muellerzr-fix-model-kwargs
muellerzr
muellerzr muellerzr requested a review from ArthurZucker ArthurZucker 1 year ago
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker commented on 2025-01-24
Bachstelze
ArthurZucker
ArthurZucker approved these changes on 2025-01-30
ArthurZucker ArthurZucker marked this pull request as ready for review 1 year ago
muellerzr
muellerzr commented on 2025-02-05
muellerzr muellerzr force pushed from cbddbc92 to 6b380f40 1 year ago
muellerzr
ArthurZucker
ArthurZucker commented on 2025-02-06
ArthurZucker
ArthurZucker approved these changes on 2025-02-06
muellerzr Save state
d3c618e1
muellerzr Make a failing test
c4895270
muellerzr Better test
8a58190a
muellerzr mpt -> done, many more to go
4348e364
muellerzr Rm extranious
3b3dfd27
muellerzr Bamba
2bf5390b
muellerzr Bert
34f90601
muellerzr big_bird
39605024
muellerzr biogpt
a87ed159
muellerzr bloom
2705ae63
muellerzr codegen
33e718ba
muellerzr ctrl
e2158481
muellerzr data2vec
72459fa6
muellerzr dbrx
212ee518
muellerzr Through up to Dbrx
81597934
muellerzr electra
f5cf7812
muellerzr ernie
96e26f6a
muellerzr falcon
1ac07d35
muellerzr Fuyu/persimmon
96666914
muellerzr Include noop kwargs to base models
d2d8f8e3
muellerzr Rebase
bf112caa
muellerzr Skip musigen
308b91d4
muellerzr Refactor/skip mllama
ad5e487a
muellerzr Revert makefile
14c121da
muellerzr Rm file
fcf896c9
muellerzr Fix PT failing, need to modify rest of loss funcs to not resize
24b59bfe
muellerzr Propagate some
6320ab43
muellerzr Continue
44530b65
muellerzr More
978dbbe8
muellerzr More options
ea4484e6
muellerzr Mostly fixed
12627ef8
muellerzr Proved that it's the same
dc42e658
muellerzr Bloom is good
9f23ae73
muellerzr Make ability to override loss func possible
12c00f64
muellerzr Fixup
b6fb6066
muellerzr Clean
cfb3bcfd
muellerzr Fix xglm
f7eda3b1
muellerzr Quality tests
6d344199
muellerzr Skip OCR2
c103851d
muellerzr Make specific loss for xglm
bde0bef4
muellerzr Make order the same/line up 1:1
2f951dd2
muellerzr xglm
5204b53b
muellerzr Skip fx output loss bloom model
038dc55a
muellerzr muellerzr force pushed from d93121a3 to 038dc55a 1 year ago
muellerzr Didn't pass in pad_token_id
6033db8e
muellerzr Fix quality
ff06a1dc
muellerzr muellerzr merged 28f73bc3 into main 1 year ago
muellerzr muellerzr deleted the muellerzr-fix-model-kwargs branch 1 year ago
eljandoubi
muellerzr
zheka77111

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone