[model weights] zero_to_fp32 multiple improvements #1181
add live zero checkpoint to fp32 consolidation version
1ff9e2ea
some more docs
ad578e1c
zero2 model states uses a different filename
1acd0741
fix
3f25d286
Merge remote-tracking branch 'origin/master' into z2fp32-auto
3b3282fb
make debug mode cli configurable
6bef851a
copy the script only on node 0 process 0
440e298d
validate that we have the right number of files
673f37b7
revamp _get_zero_param_shapes, instrument with easier debug
17d6a200
stas00
changed the title [WIP] [model weights] add live zero checkpoint to fp32 consolidation version [WIP] [model weights] zero_to_fp32 multiple improvements 4 years ago
correct assertion
88e48820
rename API; add even simpler API
50d42c39
style
12e61a0f
docs improve
de79132f
stas00
changed the title [WIP] [model weights] zero_to_fp32 multiple improvements [model weights] zero_to_fp32 multiple improvements 4 years ago
update the docs
24476cf1
Merge branch 'master' into z2fp32-auto
c86be9aa
stas00
commented
on 2021-07-09
Merge branch 'master' into z2fp32-auto
99d25fb3
stas00
commented
on 2021-07-09
Merge remote-tracking branch 'origin/master' into z2fp32-auto
ee7d6a7a
Merge branch 'master' into z2fp32-auto
6bc3ce21
Merge branch 'master' into z2fp32-auto
4b52e401
tjruwase
approved these changes
on 2021-07-12
revert the unpartitioned_params detection and report as it's most lik…
70adb32a
Merge branch 'master' into z2fp32-auto
0dd33e9d
tjruwase
merged
2a921069
into master 4 years ago
stas00
deleted the z2fp32-auto branch 4 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub