Megatron-DeepSpeed
f4c7c67e
- fix: use deepspeed param count method
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
fix: use deepspeed param count method
References
#204 - Compute model param count once
Author
jaketae
Parents
c2d63903
Loading