DeepSpeed
1ef9b029 - stage_1_and_2: optimize clip calculation to use clamp (#5632)

Commit
1 year ago
stage_1_and_2: optimize clip calculation to use clamp (#5632) instead of "if" that causes host/device synchronization and introduces a bubble, while clamp is hapenning on the device
Author
Parents
Loading