Megatron-DeepSpeed
2e4d03b8 - start using GPTMegLMHeadModel + cleanup

Commit
4 years ago
start using GPTMegLMHeadModel + cleanup
Author
Parents
Loading