DeepSpeed
[CPU] Faster reduce kernel for SHM allreduce
#4049
Merged

[CPU] Faster reduce kernel for SHM allreduce #4049

delock
delock faster allreduce with omp parallel for reduce kernel
38537c42
delock delock requested a review from jeffra jeffra 2 years ago
delock delock requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
delock delock requested a review from cmikeh2 cmikeh2 2 years ago
delock delock requested a review from awan-10 awan-10 2 years ago
delock delock requested a review from arashb arashb 2 years ago
tjruwase
tjruwase approved these changes on 2023-07-26
mrwyattii mrwyattii merged 7f26bb6a into master 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone