bigscience-workshop/Megatron-DeepSpeed

Pull Requests Commits

Merge remote-tracking branch 'origin/main' into olruwase/sync_layer_norms

stas00 committed 3 years ago

5b368846

Eval harness (#212)

DanielHesslow committed 3 years ago

Verified 3ab0ad18

Fixed MLM dataset arguments(#290)

thomasw21 committed 3 years ago

Verified 55f8cf8b

Mlm adaptation (#287)

Lintang Sutawika committed 3 years ago

Verified 9d264312

Fix DS init (#285)

Quentin-Anthony committed 3 years ago

Verified 987663c1

Fix tflops glu computation (#283)

Muennighoff committed 3 years ago

Verified e23393fb

[valid] deadlock workaround (#282)

stas00 committed 3 years ago

Verified cb48bd2c

Fix mixed fused layer norm to mimick nn.LayerNorm for torch>1.11 (#281)

thomasw21 committed 3 years ago

Verified 908dc9cb

Update CODEOWNERS

TevenLeScao committed 3 years ago

Verified c85b7c25

Update CODEOWNERS

TevenLeScao committed 3 years ago

Verified b6266f55

Update CODEOWNERS

TevenLeScao committed 3 years ago

Verified c6f22c41

Create CODEOWNERS

TevenLeScao committed 3 years ago

Verified 89c343f7

stas00 committed 3 years ago

4c13c617

add start-fast doc (#278)

stas00 committed 3 years ago

Verified 40bd933d

Fix device issue when using torch.broadcast

thomasw21 committed 3 years ago

475f3730

Sync torch_rng_state (#277)

thomasw21 committed 3 years ago

Verified d576775c

add stop alarm instructions

stas00 committed 3 years ago

Verified 640c8180

stas00 committed 3 years ago

2ac141b1

improve the doc, and comment out the demo

stas00 committed 3 years ago

86b726cb

tjruwase committed 3 years ago

84825956

add 2 more weights to sync

stas00 committed 3 years ago

bf7eeb3a

stas00 committed 3 years ago

d64a947e

add the test script

stas00 committed 3 years ago

d2aa4f18

stas00 committed 3 years ago

4443e6d2

dynamically discovered layer norm weights / refactor

stas00 committed 3 years ago

fc8f813d

fix requirements

stas00 committed 3 years ago

8f2ea60b

Sync lp/hp/optim for layer norms

tjruwase committed 3 years ago

c7f20066

stas00 committed 3 years ago

a5b5edc0

Merge remote-tracking branch 'origin/main' into thomas/test_different_layer_norm

stas00 committed 3 years ago

3c5e4914

`torch.testing.assert_equal` didn't make it (#273)

stas00 committed 3 years ago

Verified 87a9dba0

Older