DeepSpeed
7f0950f8 - DeepSpeedZeroOptimizer_Stage3: remove cuda specific optimizer (#5138)

Commit
1 year ago
DeepSpeedZeroOptimizer_Stage3: remove cuda specific optimizer (#5138) during cpu offload there was a usage in cuda fused adam, as a backup optimizer. This is a specific accelerator code, also in non-critical path. --------- Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Author
Parents
Loading