DeepSpeed
Fix cpu-adam AVX performance
#1637
Merged

Fix cpu-adam AVX performance #1637

jeffra merged 11 commits into master from fix-CPUAdam-AVX
RezaYazdaniAminabadi
fixing the softmax masking when using triangular masking
f7ef4b5e
Merge branch 'master' of github.com:microsoft/DeepSpeed
dfb603fe
Merge branch 'master' of github.com:microsoft/DeepSpeed
c5ecf325
Merge branch 'master' of github.com:microsoft/DeepSpeed
426ecf73
Merge branch 'master' of github.com:microsoft/DeepSpeed
fde63105
Merge branch 'master' of github.com:microsoft/DeepSpeed
5bd5f2bb
Merge branch 'master' of github.com:microsoft/DeepSpeed
9691d5ef
Merge branch 'master' of github.com:microsoft/DeepSpeed
3460d201
Merge branch 'master' of github.com:microsoft/DeepSpeed
0af9e1c4
fixing the loop-unrolling for the avx operations
f40e2308
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from awan-10 awan-10 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from cli99 cli99 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from conglongli conglongli 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from eltonzheng eltonzheng 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from jeffra jeffra 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from minjiaz minjiaz 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from samyam samyam 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from ShadenSmith ShadenSmith 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from tjruwase tjruwase 4 years ago
jeffra
jeffra commented on 2021-12-15
jeffra
jeffra approved these changes on 2021-12-15
remove non-necessary changes
80b184e7
jeffra jeffra merged 259936a7 into master 4 years ago
jeffra jeffra deleted the fix-CPUAdam-AVX branch 4 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone