Megatron-DeepSpeed
[WIP] dealing with multi-process noise
#193
Merged

[WIP] dealing with multi-process noise #193

stas00 merged 10 commits into main from silence-please
stas00
stas00 deal with tokenizer logs
2d3ae7c7
stas00 Trigger CI
a95f8730
stas00 fused_kernels.load noise
b815d99c
stas00 control ds replica noise
c0993d6f
stas00 control ds replica noise
e5b84cc7
stas00 ds info log on rank0 only
e89cf790
stas00 only on rank 0
cb5e4911
stas00 only if --deepspeed
5ae56140
stas00 fix
e67c17fb
stas00 replica logging for transformers
511dffb4
stas00 stas00 merged 5d3150c7 into main 4 years ago
stas00 stas00 deleted the silence-please branch 4 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone