DeepSpeed
GPT-J inference support
#1670
Merged

GPT-J inference support #1670

jeffra merged 6 commits into master from gptj-inference-support
RezaYazdaniAminabadi
fix the top1gating logic when use_rts is set to false
7a459056
Modify inference-api and add some kernels to support GPT-J inference
713397f5
fix kernels header
0a4bfa63
fixing some issues in kernels and API
7418b729
fix half version of rotary_pos_emb kernel
8a99292b
RezaYazdaniAminabadi RezaYazdaniAminabadi marked this pull request as ready for review 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from awan-10 awan-10 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from cli99 cli99 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from conglongli conglongli 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from eltonzheng eltonzheng 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from jeffra jeffra 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from minjiaz minjiaz 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from samyam samyam 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from ShadenSmith ShadenSmith 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from tjruwase tjruwase 3 years ago
jeffra Merge branch 'master' into gptj-inference-support
97e5ab2c
jeffra jeffra enabled auto-merge (squash) 3 years ago
jeffra
jeffra approved these changes on 2022-01-08
jeffra jeffra merged 289c3f9b into master 3 years ago
mrwyattii mrwyattii deleted the gptj-inference-support branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone