DeepSpeed
Add more synchronizations and barriers for the multi-gpu inference case
#1309
Merged

Add more synchronizations and barriers for the multi-gpu inference case #1309

RezaYazdaniAminabadi merged 6 commits into master from reyazda/mp_inference
RezaYazdaniAminabadi
add more synchronizations and barriers for resolving gpu-halt issue
045d19d9
removing unuseful broadcasts
5038b077
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from awan-10 awan-10 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from cli99 cli99 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from conglongli conglongli 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from eltonzheng eltonzheng 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from jeffra jeffra 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from minjiaz minjiaz 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from niumanar niumanar 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from samyam samyam 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from ShadenSmith ShadenSmith 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from tjruwase tjruwase 4 years ago
RezaYazdaniAminabadi Merge branch 'master' into reyazda/mp_inference
77451e8e
hyunwoongko
hyunwoongko commented on 2021-08-21
RezaYazdaniAminabadi Merge branch 'master' into reyazda/mp_inference
a116250f
RezaYazdaniAminabadi Merge branch 'master' into reyazda/mp_inference
a9c76ba2
RezaYazdaniAminabadi Merge branch 'master' into reyazda/mp_inference
7c629a40
jeffra
jeffra approved these changes on 2021-08-27
RezaYazdaniAminabadi RezaYazdaniAminabadi merged 0ec11daa into master 4 years ago
mrwyattii mrwyattii deleted the reyazda/mp_inference branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone