DeepSpeed
Fix missing scale attributes for GPTJ
#3256
Merged

Fix missing scale attributes for GPTJ #3256

jeffra merged 7 commits into master from cholmes/gptj-weight-scale-fix
cmikeh2
cmikeh2 Fix missing scale attributes
565995af
cmikeh2 cmikeh2 requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
cmikeh2 cmikeh2 requested a review from jeffra jeffra 2 years ago
cmikeh2 cmikeh2 requested a review from mrwyattii mrwyattii 2 years ago
cmikeh2 cmikeh2 requested a review from awan-10 awan-10 2 years ago
cmikeh2 cmikeh2 requested a review from arashb arashb 2 years ago
cmikeh2 Fix double usage of scratch space
a4884888
cmikeh2 Merge branch 'master' into cholmes/gptj-weight-scale-fix
49f9d5bc
tjruwase Merge branch 'master' into cholmes/gptj-weight-scale-fix
4c2b6c12
mrwyattii
mrwyattii approved these changes on 2023-04-20
mrwyattii Merge branch 'master' into cholmes/gptj-weight-scale-fix
0fd4a3fb
mrwyattii mrwyattii enabled auto-merge (squash) 2 years ago
mrwyattii Merge branch 'master' into cholmes/gptj-weight-scale-fix
4cc8faac
disabled auto-merge 2 years ago
Manually disabled by user
mrwyattii Merge branch 'master' into cholmes/gptj-weight-scale-fix
829496ff
mrwyattii mrwyattii enabled auto-merge (squash) 2 years ago
disabled auto-merge 2 years ago
Manually disabled by user
jeffra
jeffra approved these changes on 2023-04-21
jeffra jeffra merged 145c3a75 into master 2 years ago
jeffra jeffra deleted the cholmes/gptj-weight-scale-fix branch 2 years ago
conglongli conglongli added deepspeed-chat
conglongli conglongli removed deepspeed-chat
heroes999

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone