Megatron-DeepSpeed
c7f20066 - Sync lp/hp/optim for layer norms

Commit
3 years ago
Sync lp/hp/optim for layer norms
Author
Parents
Loading