Megatron-DeepSpeed
Expose GLU activations as arguments
#69
Merged

Expose GLU activations as arguments #69

jaketae merged 15 commits into main from activation-args
jaketae
jaketae feat: expose glu activations as argument
3ba7882d
jaketae jaketae requested a review from thomasw21 thomasw21 4 years ago
jaketae
thomasw21
thomasw21 commented on 2021-08-18
thomasw21
thomasw21 commented on 2021-08-18
jaketae chore: rename activations -> glu_activations
f004a5aa
jaketae refactor: use lookup dict instead of `getattr()`
c064e2e0
thomasw21
thomasw21 commented on 2021-08-18
jaketae
thomasw21
jaketae refactor: mv lookup dict to `glu_activations.py`
2944fe43
jaketae Merge branch 'main' into activation-args
7d04752e
jaketae chore: rm unnecessary default arg
3729ebc2
jaketae Merge branch 'activation-args' of github.com:bigscience-workshop/Mega…
04e397e1
jaketae
stas00
stas00 requested changes on 2021-08-22
stas00
jaketae test: add bf16 test; gelu in `test_training_all()`
b3c6bbe3
jaketae
stas00
stas00 commented on 2021-08-22
stas00
stas00 commented on 2021-08-22
jaketae Update megatron/testing_utils.py
ff248f0a
jaketae refactor: use `require_torch_bf16` decorator
0056ab7d
stas00
jaketae
stas00
stas00
jaketae
jaketae chore: comment out bf16 test
4097af6f
stas00 consistent style
dc61f89e
stas00 fix look up table
6798e819
stas00
stas00 approved these changes on 2021-08-22
stas00 better grouping
ba8f4041
stas00
stas00 commented on 2021-08-22
jaketae fix: replace hard coded options with `GLU_ACTIVATIONS`
0f36f705
jaketae jaketae merged b5a029d9 into main 4 years ago
jaketae jaketae deleted the activation-args branch 4 years ago
jaketae
stas00

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone