DeepSpeed
don't gather partitioned activations for mp size 1
#2454
Merged

don't gather partitioned activations for mp size 1 #2454

guoyejun
guoyejun guoyejun requested a review from jeffra jeffra 3 years ago
guoyejun guoyejun requested a review from samyam samyam 3 years ago
guoyejun guoyejun requested a review from tjruwase tjruwase 3 years ago
guoyejun guoyejun requested a review from ShadenSmith ShadenSmith 3 years ago
guoyejun guoyejun requested a review from conglongli conglongli 3 years ago
guoyejun guoyejun requested a review from awan-10 awan-10 3 years ago
guoyejun guoyejun requested a review from cli99 cli99 3 years ago
guoyejun guoyejun requested a review from eltonzheng eltonzheng 3 years ago
guoyejun guoyejun requested a review from minjiaz minjiaz 3 years ago
guoyejun guoyejun requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 3 years ago
guoyejun guoyejun requested a review from duli2012 duli2012 3 years ago
guoyejun guoyejun requested a review from mrwyattii mrwyattii 3 years ago
guoyejun guoyejun requested a review from yaozhewei yaozhewei 3 years ago
guoyejun guoyejun requested a review from arashb arashb 3 years ago
guoyejun guoyejun requested a review from xiaoxiawu-microsoft xiaoxiawu-microsoft 3 years ago
guoyejun guoyejun requested a review from samadejacobs samadejacobs 3 years ago
guoyejun guoyejun requested a review from cmikeh2 cmikeh2 3 years ago
guoyejun guoyejun requested a review from GuanhuaWang GuanhuaWang 3 years ago
guoyejun guoyejun force pushed from 1bd43d81 to 705eabb2 3 years ago
guoyejun
mrwyattii
mrwyattii
tjruwase
guoyejun
guoyejun don't gather partitioned activations for mp size 1
95f0a4ac
guoyejun add inline comment for the change
982a0488
guoyejun guoyejun force pushed from 8d1d8214 to 982a0488 3 years ago
guoyejun
tjruwase Merge branch 'master' into checkpointing
420c3bde
tjruwase
tjruwase approved these changes on 2022-11-04
tjruwase tjruwase merged f74ee318 into master 3 years ago
tjruwase

Login to write a write a comment.

Login via GitHub