PR #2566 Support fp32 gradaccum for bf16 model

Support fp32 gradaccum for bf16 model #2566

tjruwase merged 5 commits into deepspeedai:master from delock:gma/support_fp32_gradaccum_for_bf16_model

allow bf16 model with fp32 gradient accumulation datatype

e1369363

allow fp32 gradient accumulation and bfloat16 model in amp mode

7c57bd98

delock requested a review from

jeffra 3 years ago

delock requested a review from

tjruwase 3 years ago

Merge branch 'master' into gma/support_fp32_gradaccum_for_bf16_model

eda2f83c

alternative fix for grad accumulation type mismatch. In the case of …

62cad7a1

Merge branch 'master' into gma/support_fp32_gradaccum_for_bf16_model

19d11394

tjruwase approved these changes on 2022-12-05

tjruwase merged 06938835 into master 3 years ago

Reviewers

tjruwase

jeffra

Assignees

No one assigned

Labels

None yet

Milestone

No milestone