DeepSpeed
Fix hook count performance regression from v0.18.5
#7886
Merged

Fix hook count performance regression from v0.18.5 #7886

tohtana
tohtana Add should_refresh_expected_hook_count and harden enter_backward
e1b99f70
tohtana Fix ZeRO-2 hook to cache count_used_parameters_in_backward result
91b591bc
tohtana Fix ZeRO-3 hooks to cache count_used_parameters_in_backward result
d3a6e996
tohtana
chatgpt-codex-connector
delock
delock commented on 2026-03-05
delock
delock approved these changes on 2026-03-05
mjkvaak-amd
rraminen
tohtana tohtana marked this pull request as ready for review 4 days ago
tohtana tohtana requested a review from tjruwase tjruwase 4 days ago
tohtana tohtana requested a review from loadams loadams 4 days ago
tohtana Add regression tests for hook count performance fix
e1b41ee5
tohtana Add comment clarifying refresh-before-reenter ordering
c43e4237
tohtana tohtana force pushed from fd933f3e to c43e4237 4 days ago
tohtana tohtana merged 6c59d544 into master 4 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone