transformers
add53e25 - don't use no_sync when deepspeed doesn't support it for certain zero stages (#35157)

Commit
1 year ago
don't use no_sync when deepspeed doesn't support it for certain zero stages (#35157) * don't use no_sync when deepspeed doesn't support it for certain zero stages * chore: lint * fix no_sync context for deepspeed across all zero types * chore: lint
Author
Parents
Loading