Fix for checkpoint rename race condition #28364
Changed logic for renaming staging directory when saving checkpoint t…
728231ce
Updated styling using make fixup
f2452fda
Updated check for main process to use built-in versions from trainer
3c1c92fa
Fixed incorrect usage of trainer main process checks
df9f6e9a
Removed "with open" due to not working with directory. os.open seems …
eb586987
tblattner
force pushed
to
eb586987
2 years ago
muellerzr
approved these changes
on 2024-01-09
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub