transformers
fix overflow when training mDeberta in fp16
#24116
Merged

fix overflow when training mDeberta in fp16 #24116

sjrl
sjrl Porting changes from https://github.com/microsoft/DeBERTa/ that hopef…
04651857
sjrl Updates to deberta modeling from microsoft repo
b367defb
sjrl Performing some cleanup
9c056f23
sjrl Undoing changes that weren't necessary
ba8f2ade
sjrl Undoing float calls
3492dd70
sjrl Minimally change the p2c block
b75fbd8a
sjrl Fix error
b5b697ab
sjrl Minimally changing the c2p block
6d69c7fb
sjrl Switch to torch sqrt
0ea34591
sjrl Remove math
3c95c8a1
sjrl Adding back the to calls to scale
dd8bd345
sjrl Undoing attention_scores change
b930014d
sjrl Removing commented out code
9d22fde5
amyeroberts
sjrl Updating modeling_sew_d.py to satisfy utils/check_copies.py
f9d52efd
ArthurZucker
ArthurZucker commented on 2023-06-08
sjrl Missed changed
c90cde89
HuggingFaceDocBuilderDev
sjrl Further reduce changes needed to get fp16 working
9969b99d
sjrl Reverting changes to modeling_sew_d.py
1ec5df7b
sjrl
ArthurZucker
ArthurZucker approved these changes on 2023-06-09
ArthurZucker ArthurZucker requested a review from amyeroberts amyeroberts 2 years ago
sjrl
amyeroberts
Rocketknight1
younesbelkada
younesbelkada approved these changes on 2023-06-12
sjrl
Rocketknight1
sjrl Make same change in TF
0c9ca818
amyeroberts
amyeroberts approved these changes on 2023-06-13
amyeroberts
sjrl
amyeroberts amyeroberts merged 3e142cb0 into main 2 years ago
sjrl sjrl deleted the mdeberta-fp16-overflow branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone