Megatron-DeepSpeed
a3ef7783
- Merge remote-tracking branch 'origin/main' into ds_ckpt_reshape
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Merge remote-tracking branch 'origin/main' into ds_ckpt_reshape
References
#239 - Reshape deepspeed checkpoint
#289 - No-ZeRO reshaping
#292 - a branch combining layer-norm-auto-sync and ds_ckpt_reshape
Author
stas00
Parents
29ca2bcc
908dc9cb
Loading