DeepSpeed
1ef9b029
- stage_1_and_2: optimize clip calculation to use clamp (#5632)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
stage_1_and_2: optimize clip calculation to use clamp (#5632) instead of "if" that causes host/device synchronization and introduces a bubble, while clamp is hapenning on the device
References
#5632 - stage_1_and_2: optimize clip calculation to use clamp
Author
nelyahu
Parents
6e2899fb
Loading