DeepSpeed
Add support of OPT models
#2205
Merged

Add support of OPT models #2205

tjruwase merged 17 commits into master from arashb/opt
arashb
arashb add opt replace policy
17edaa5d
arashb simplify inf. api
3543b833
arashb fix opt replace policy
418f3764
fix use-cash & add relu
64ab1d5c
arashb Add support of custom MLP act. function
03f76199
arashb Revert "simplify inf. api"
f8aef938
arashb fix the inference API (temp. solution)
e7fecac3
arashb arashb force pushed from 545bb422 to e7fecac3 3 years ago
arashb fix code formatting
b8e94d5a
arashb add unit tests for OPT models.
27cabdc7
arashb refactor pre-attention layer norm configuration
2f97d471
arashb add support of opt-350m model
976fbd40
arashb arashb marked this pull request as ready for review 3 years ago
arashb arashb requested a review from jeffra jeffra 3 years ago
arashb arashb requested a review from samyam samyam 3 years ago
arashb arashb requested a review from tjruwase tjruwase 3 years ago
arashb arashb requested a review from ShadenSmith ShadenSmith 3 years ago
arashb arashb requested a review from conglongli conglongli 3 years ago
arashb arashb requested a review from awan-10 awan-10 3 years ago
arashb arashb requested a review from cli99 cli99 3 years ago
arashb arashb requested a review from eltonzheng eltonzheng 3 years ago
arashb arashb requested a review from minjiaz minjiaz 3 years ago
arashb arashb requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 3 years ago
arashb arashb requested a review from duli2012 duli2012 3 years ago
arashb arashb requested a review from mrwyattii mrwyattii 3 years ago
arashb arashb requested a review from yaozhewei yaozhewei 3 years ago
arashb arashb requested a review from xiaoxiawu-microsoft xiaoxiawu-microsoft 3 years ago
arashb arashb requested a review from samadejacobs samadejacobs 3 years ago
arashb arashb changed the title Add support of OPT Models Add support of OPT models 3 years ago
jeffra Merge branch 'master' into arashb/opt
50f29a0c
arashb refactor the HF model config initialization
35952e5b
arashb fix hf model config issue
8e1cd3db
jeffra
jeffra approved these changes on 2022-08-12
jeffra Merge branch 'master' into arashb/opt
138dec72
RezaYazdaniAminabadi
RezaYazdaniAminabadi approved these changes on 2022-08-12
arashb Merge branch 'master' into arashb/opt
d2876e96
RezaYazdaniAminabadi Merge branch 'master' into arashb/opt
282bae4c
tjruwase tjruwase merged 8b2a6371 into master 3 years ago
mrwyattii mrwyattii deleted the arashb/opt branch 2 years ago

Login to write a write a comment.

Login via GitHub