Megatron-DeepSpeed
8e22824e - Fix token alignment, add mpu checkpointing, misc training code

Commit
5 years ago
Fix token alignment, add mpu checkpointing, misc training code
Author
Neel Kant
Parents
Loading