Reweighting strat for prefix lm #190
First test to un bias the loss for prefix lm
c8d3243f
Woops
226cf712
Add same code for not deepspeed mode
a58c0413
Improve testing
3ce2154b
Woops
74fabfb7
Test moving it inside?
53e64030
This changes the normalization factor in loss computation
2981380b
Fix
8a83121f
Woops
de8c56d4
Better refactoring of loss normalization
48953683
thomasw21
merged
b3cf1755
into main 4 years ago
thomasw21
deleted the thomas/reweight_tokens_depending_on_their_position branch 3 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub