Megatron-DeepSpeed
bdd75f18
- 1B3 parameter setup + flos counting
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
1B3 parameter setup + flos counting
References
#40 - Floating-point ops counting and reloading
Author
TevenLeScao
Parents
a3a4ba46
Loading