Megatron-DeepSpeed
2d9744f2
- Alternative fix to TP > 1 (#178)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Alternative fix to TP > 1 (#178) * Only tp_rank_0 need to load dataset, we broadcast necessary meta data to the other ranks * Collapse None and empty lists as well as empty dataloader and eval_iters ==0 * Woops * Add plural to valid and test
References
#178 - Alternative fix to TP > 1
Author
thomasw21
Parents
b982e040
Loading