Megatron-DeepSpeed
[WIP] [fp32 checkpoint] very early experiments with extracting fp32 params
#112
Open

Loading