DeepSpeed
[inference] ds-attention refactor w.r.t. ops
#2623
Merged

[inference] ds-attention refactor w.r.t. ops #2623

jeffra merged 17 commits into master from jeffra/op-shim
jeffra
jeffra refactor of ds-attn to use new op bindings
4c6a2574
jeffra Merge branch 'master' into jeffra/op-shim
4436042c
jeffra fix imports
dd6088f6
jeffra fix for softmax
8c1f1aab
jeffra fix typo
e7610c43
jeffra jeffra marked this pull request as ready for review 3 years ago
jeffra jeffra requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 3 years ago
jeffra jeffra requested a review from mrwyattii mrwyattii 3 years ago
jeffra jeffra requested a review from awan-10 awan-10 3 years ago
jeffra jeffra requested a review from cmikeh2 cmikeh2 3 years ago
jeffra jeffra requested a review from arashb arashb 3 years ago
jeffra jeffra changed the title Inference backend refactor [inference] ds-attention refactor w.r.t. ops 3 years ago
cmikeh2
cmikeh2 commented on 2022-12-19
cmikeh2
cmikeh2 commented on 2022-12-19
jeffra Merge branch 'master' into jeffra/op-shim
3be21698
cmikeh2
cmikeh2 commented on 2022-12-19
jeffra address comments and consolidate imports
cabf95a7
jeffra Merge branch 'jeffra/op-shim' of github.com:microsoft/DeepSpeed into …
f413336c
jeffra fix import issue
804e1b94
jeffra move bloom specific attn into its own class
7a845df9
cmikeh2
cmikeh2 approved these changes on 2022-12-21
jeffra remove dead code
ba1ffafc
jeffra Merge branch 'master' into jeffra/op-shim
7eec8848
jeffra Merge branch 'master' into jeffra/op-shim
7c0c1dcf
jeffra Merge branch 'master' into jeffra/op-shim
f722064e
jeffra Merge branch 'master' into jeffra/op-shim
ba0690c2
jeffra move softmax op to BloomSelfAttention
3b239335
jeffra only load op in base to avoid excessive logging
3b99b579
jeffra jeffra merged bb68c526 into master 3 years ago
jeffra jeffra deleted the jeffra/op-shim branch 3 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone