DeepSpeed
a7118789 - Fix issue where gradient_predivide_factor was called as a func. (#996)

Commit
4 years ago
Fix issue where gradient_predivide_factor was called as a func. (#996) * Fix issue where gradient_predivide_factor was called as a func. `gradient_predivide_factor` is a `float`, hence shouldn't be called as func. This crashes when `reduce_scatter` flag is set to `False`.
Author
Parents
Loading