Megatron-DeepSpeed
f4c7c67e - fix: use deepspeed param count method

Commit
4 years ago
fix: use deepspeed param count method
Author
Parents
Loading