Megatron-DeepSpeed
Prefix lm
#52
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
98
Changes
View On
GitHub
Prefix lm
#52
thomasw21
merged 98 commits into
bigscience-workshop:main
from
thomasw21:prefix_lm
ICT zeroshot evaluation code
8b04e0e4
made more generic, aligned with other tasks
661553f6
Fixed based on review recoemmendation
43c9137b
fixed another issue
40565390
Merge branch 'main' into main_retriver_merge_ict_eval
a5acbf53
implementing DPR
10ff0607
Merge branch 'main' into main_retriver_merge_dpr
cdde4338
implementation dpr
06076c7a
Merge branch 'main' into main_retriver_merge_dpr
957d1c9a
adding dpr code
b9fcb7b4
removed commnets
80047314
removed commnets
f415dc85
removed commnets
a8d172b3
DPR evaluation debugging
220637f9
DPR ongoing
d2d5086e
DPR finetune and evaluation
6d03d7af
fixing model evaluation of retriver
f9267205
added pre ad post process
54093417
added pre ad post process
7e335e15
evaluation works!
f64977fd
debugging DPR
dca47cfb
fix copy-n-paste error
3f755374
Typo fix in readme
07ca952e
t5 fixes
2dae74b6
Merge branch 'main' into main_retriver_merge_dpr
4a09bb3c
before cleaning the comments
7a0710ec
vit pipeline fixes
ccae9dbd
cleaning the code
2eaf6c79
additional cleaning
25293807
renaming the folders
8e44d619
Add temporary assert to finetuning until it can be fixed.
113c6362
Fixed issues with ICT pretraining
7577931b
updated the evaluation script for retriver
dfb6a9b3
updated the evaluation script for retriver
f21a6628
updated the evaluation script for retriver
a41e4781
updated the evaluation script for retriver
825375cf
Merge branch 'finetune_assert' into 'main'
217f54b3
added exit interval for finetuning
d078e54a
updating the scripts
63121a9e
updating no load rng
fda81a21
Merge branch 'vit_pipeline_fixes' into 'main'
01fc0833
Merge branch 'main_retriver_merge_dpr' into 'main'
83c4d95a
updating script
c7c65bbb
Merge branch 'main' into main_retriver_merge_dpr
84eb016c
Update T5 scripts
3dadd16d
resolved hang issue
04c79f30
fixed the tensor size miss-mass issue
ebfbfcec
fixed the evaluation hangs
e46f3260
Adding readme
a983cab3
Adding readme
d562d7b5
Adding readme
1095d7e6
Adding readme
bab5cc4e
Adding readme
8661ca26
Adding readme
293554aa
Adding readme
e287bf00
Adding readme
c45109ed
Clean up README.md a bit
473127f9
addressed comments
28450473
Merge branch 'main_retriver_merge_dpr' of ssh://gitlab-master.nvidia.…
98113c69
Merge branch 'main_retriver_merge_dpr' into 'main'
598d7ee2
Merge branch 't5_scripts' into 'main'
2be1e510
updated readme
9d350c9c
updated readme
baf2e2a4
updated readme
32da2e78
updated readme
4c92ca82
Merge branch 'main_retriver_merge_dpr' into 'main'
82b69e86
Merge branch 't5' of https://github.com/stas00/Megatron-LM into githu…
7898c9a4
Merge branch 'typo-fix' of https://github.com/devrimcavusoglu/Megatro…
e1318f0e
Merge branch 'patch-1' of https://github.com/stas00/Megatron-LM into …
4a35d50a
Merge branch 'github-pr' into 'main'
90e0a0dd
Basic handling of prefix lm by updating the mask
b9fbe3db
Add prefix option to gpt temporarily and prevent it to use custom kernel
f69a0023
Add argument for prefix lm, in order to configure masking strategy
c63eb38f
Woops
bc5313bf
thomasw21
commented on 2021-08-05
loss_on_targets_only flag, assert that current prefix implementation …
60a5884d
Format
fa134edf
huu4ontocord
commented on 2021-08-05
huu4ontocord
commented on 2021-08-05
Reverse renaming
3b678fe0
Allow prefix on partial document at the end
fe5f9b6a
WIP: add prefix per row feature
a3840d79
Document the use of None
e4547612
Woops
150b5a1b
Handle empty document better
3f9efbbd
We might not be able to concat empty tensors
9802ca6f
Handle empty tensor seperately
fcb34b39
Debug
8e43c960
Test
3d004ef9
Add loss masking as script argument
675ef50e
huu4ontocord
commented on 2021-08-05
thomasw21
assigned
thomasw21
4 years ago
thomasw21
added
arch&scale
thomasw21
added
enhancement
Turns out deepspeed integration of attention matrices prevented dynam…
e304b218
Add more asserts
30fddd22
thomasw21
commented on 2021-08-06
thomasw21
commented on 2021-08-06
Prefix can only see the prefix, it cannot see target
23397ad0
thomasw21
changed the title
WIP: Prefix lm
Prefix lm
4 years ago
thomasw21
marked this pull request as ready for review
4 years ago
Remove prefix-lm argument as we split the pretrain script
28a712d4
ibeltagy
requested changes on 2021-09-07
ibeltagy
requested changes on 2021-09-15
Merge remote-tracking branch 'origin/master' into prefix_lm
d49d6e5e
Iz PR review
bbfac966
Make masking row dependent when using prefix
0cdb0a94
Revert "Merge remote-tracking branch 'origin/master' into prefix_lm"
a7c51aa1
Make asserts concerning the choice on loss_on_targets_only
6db9b970
Tests (#1)
295e8d0d
ibeltagy
requested a review
from
ibeltagy
4 years ago
ibeltagy
approved these changes on 2021-09-16
Update code for prefix lm
6a96fb9c
thomasw21
merged
68b46f20
into main
4 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
ibeltagy
sbmaruf
huu4ontocord
Assignees
thomasw21
Labels
enhancement
arch&scale
Milestone
No milestone
Login to write a write a comment.
Login via GitHub