DeepSpeed
9f0e2136 - compute global norm on device (#5125)

Commit
1 year ago
compute global norm on device (#5125) Avoid host synchronization by keeping data on device --------- Co-authored-by: Logan Adams <loadams@microsoft.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Author
Parents
Loading