DeepSpeed
Fix sparse attention for small block-sizes
#1545
Merged

Fix sparse attention for small block-sizes #1545

jeffra merged 9 commits into master from fix-sparse-attn
RezaYazdaniAminabadi
fixing the softmax masking when using triangular masking
f7ef4b5e
Merge branch 'master' of github.com:microsoft/DeepSpeed
dfb603fe
Merge branch 'master' of github.com:microsoft/DeepSpeed
c5ecf325
Merge branch 'master' of github.com:microsoft/DeepSpeed
426ecf73
Merge branch 'master' of github.com:microsoft/DeepSpeed
fde63105
fixing the sparse attention for low block-size
542b25c4
remove attn
d926bf75
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from awan-10 awan-10 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from cli99 cli99 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from conglongli conglongli 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from eltonzheng eltonzheng 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from jeffra jeffra 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from minjiaz minjiaz 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from niumanar niumanar 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from samyam samyam 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from ShadenSmith ShadenSmith 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from tjruwase tjruwase 4 years ago
RezaYazdaniAminabadi Merge branch 'master' into fix-sparse-attn
544c0bc3
jeffra Merge branch 'master' into fix-sparse-attn
abfe900e
jeffra
jeffra approved these changes on 2021-11-12
jeffra jeffra enabled auto-merge (squash) 4 years ago
jeffra jeffra merged 3ed77304 into master 4 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone