DeepSpeed
Skip empty parameters in gradient reduction
#7789
Merged

Skip empty parameters in gradient reduction #7789

tohtana
tohtana fix: skip empty parameters in gradient reduction
f8a609a3
tohtana tohtana requested a review from tjruwase tjruwase 48 days ago
PKUWZP
PKUWZP approved these changes on 2026-01-18
tohtana tohtana merged 114f971c into master 45 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone