if no expert found in parameter that have expert in name the loop should continue #7685
LckyLke
force pushed
from
147ba8d7
to
a3520183
30 days ago
fixed some weird behaivoir in deepspeed which does allow me to import…
b3888f34
Update version.txt after release (#7675)
cb0cde73
[modal ci] fixes (#7676)
e37bdefc
leaf modules: explain better (#7674)
1834cbbd
disable nv-lightning-v100.yml cI (#7681)
ad66ab29
allow seperate learning rate "muon_lr" and "adam_lr" for muon optimiz…
ed8c4363
if no expert found in parameter that have expert in name it should co…
78ada8c6
Revert "fixed some weird behaivoir in deepspeed which does allow me t…
eba17528
LckyLke
force pushed
from
a3520183
to
eba17528
30 days ago
Merge branch 'master' into master
d96c92b2
LckyLke
marked this pull request as draft 30 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub