DeepSpeed
Add Feature Universal Checkpoint for AutoTP
#7908
Merged

Add Feature Universal Checkpoint for AutoTP #7908

nathon-lee
Copilot Initial plan
001f77c3
Copilot Revert "fix: update 1 file reformatted."
b90aee5a
nathon-lee Merge pull request #5 from nathon-lee/copilot/git-revert-ff886701
b6da9afd
nathon-lee Merge branch 'deepspeedai:master' into master
bb7f64fd
Copilot Initial plan
cbc816c9
Copilot Reapply "fix: update 1 file reformatted."
5fcc9a7e
nathon-lee Merge pull request #6 from nathon-lee/copilot/remove-commits-from-master
f7c5d75d
nathon-lee feat: Refactor AutoTP universal checkpoint metadata schema handling
0513f4a5
nathon-lee fix: update unit test file test_autotp_universal_checkpoint.py
6bfea514
nathon-lee fix: update unit test file test_autotp_universal_checkpoint.py
5ab684d9
nathon-lee nathon-lee requested a review from tjruwase tjruwase 27 days ago
nathon-lee nathon-lee requested a review from tohtana tohtana 27 days ago
nathon-lee nathon-lee requested a review from hwchen2017 hwchen2017 27 days ago
nathon-lee nathon-lee requested a review from loadams loadams 27 days ago
chatgpt-codex-connector
chatgpt-codex-connector commented on 2026-03-16
nathon-lee nathon-lee changed the title Add Feature Universal Checkpoint autotp Add Feature Universal Checkpoint for AutoTP 27 days ago
PawnOfDelock
PawnOfDelock commented on 2026-03-17
delock
delock commented on 2026-03-17
delock
delock commented on 2026-03-17
delock
delock commented on 2026-03-17
delock
delock commented on 2026-03-17
delock
delock commented on 2026-03-17
delock
delock commented on 2026-03-17
delock
delock commented on 2026-03-17
inkcherry
inkcherry commented on 2026-03-17
nathon-lee Add constant for AutoTP universal-checkpoint metadata key
90e30f14
nathon-lee
nathon-lee Remove redundant callable() guard for _mark_uc_metadata hook
3f4ecc73
delock
delock commented on 2026-03-18
delock
delock commented on 2026-03-18
delock
delock commented on 2026-03-18
delock
nathon-lee tests: cover uneven sub_param_sizes in AutoTP UC restore
2bf8402c
nathon-lee fix: update some logic for _resolve_autotp_partition
0f8e4ff8
nathon-lee
nathon-lee docs: update universal checkpointing and AutoTP checkpoint docs
6c8510a8
delock Merge branch 'master' into feat_uc_autotp
4c81483e
delock
delock approved these changes on 2026-03-23
nathon-lee tests: avoid pytest import file mismatch by renaming AutoTP UC test
e09f5d19
nathon-lee
nathon-lee fix: Automatically fix file end line breaks and formatting issues
7ac0316b
nathon-lee
delock Merge branch 'master' into feat_uc_autotp
a3f96edc
delock
delock delock merged f2bb1ec6 into master 20 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone