Megatron-DeepSpeed
f75af1f9
- Offload to CPU earlier & increase number of bs in pipleine parallelism
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Offload to CPU earlier & increase number of bs in pipleine parallelism
References
#291 - BigScience Eval Harness
#313 - Prefix LM Eval
Author
Muennighoff
Parents
9af3e02e
Loading