Enable rocm-support #353

luukkonenr wants to merge 43 commits into bigscience-workshop:main from TurkuNLP:main
luukkonenr
luukkonenr Squash 3 commits to 1
ebb79c86
spyysalo Add --no-layer-norm-fusion argument
21c90de1
spyysalo Add --no-optimizer-fusion argument
e0487132
spyysalo Bugfix (thanks to Thomas Wang for catching this)
18e2c65b
hubertlu-tw Fix the bug of FusedLayerNorm on ROCm (#96)
9b7cd052
spyysalo Revert cherry-picked changes to .py
277e1d38
Muennighoff Add LUMI eval compat
2963caea
Muennighoff Update tasks
32f039c2
spyysalo Merge pull request #1 from bigscience-workshop/lumi_eval
fdd57c4b
NouamaneTazi add inverse_sqrt lr decay style
2ca2338c
NouamaneTazi fix no warmup case
ad60932f
NouamaneTazi use t5x formula
0823ad8c
NouamaneTazi avoid num_steps > decay_steps case
a093db6f
NouamaneTazi remove casting as math.sqrt does that
b4601b9e
NouamaneTazi add lr-warmup-style argument taking "constant" or "linear" values
4dae1399
NouamaneTazi refactor num_steps_
5fbb1dd5
NouamaneTazi docs
6299fb24
NouamaneTazi fix formulas
4e866509
NouamaneTazi fix formula
50c69359
NouamaneTazi correct comment
5c642dd3
NouamaneTazi note about replicating t5x
1b14a28c
NouamaneTazi Merge pull request #2 from NouamaneTazi/inverse-sqrt-lr
5e811b66
NouamaneTazi quick fix for upper triang masked softmax cuda kernel for seq_len < 8192
5365f41f
spyysalo Merge pull request #3 from NouamaneTazi/large-seqlen-kernels
98749637
spyysalo Use torch.multiprocessing.set_start_method('spawn')
c41cc5e0
spyysalo skip_warmup on __setstate__
6732bc9b
Muennighoff Copy preliminary UL2
ab29faf5
Muennighoff DeepSpeed compat
9328ad2c
Muennighoff DS Group compat
351f4f24
Muennighoff Adapt eval for denoiser
abc19b83
Muennighoff Simpler padding
816c32d1
Muennighoff Fix sampling
bdbd54a0
Muennighoff Switch padding
cacf267c
spyysalo Merge pull request #4 from TurkuNLP/ul2
47691320
Muennighoff Upate sampling
557b09ce
Muennighoff Update UL2
a6f69bf2
Muennighoff Add get_samples_mapping
d0d277fe
Muennighoff Import math
3f29df89
Muennighoff Fix prefixlm
52073863
Muennighoff tmp
9490e50e
Muennighoff Merge branch 'main' into tmp
9c8d02ca
Muennighoff Revert UL2 Tokenizer Changes
6936afba
Muennighoff Merge pull request #7 from TurkuNLP/tmp
a1088c1c

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone