DeepSpeed
GPT-J inference support
#1670
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
6
Changes
View On
GitHub
GPT-J inference support
#1670
jeffra
merged 6 commits into
master
from
gptj-inference-support
fix the top1gating logic when use_rts is set to false
7a459056
Modify inference-api and add some kernels to support GPT-J inference
713397f5
fix kernels header
0a4bfa63
fixing some issues in kernels and API
7418b729
fix half version of rotary_pos_emb kernel
8a99292b
RezaYazdaniAminabadi
marked this pull request as ready for review
3 years ago
RezaYazdaniAminabadi
requested a review
from
awan-10
3 years ago
RezaYazdaniAminabadi
requested a review
from
cli99
3 years ago
RezaYazdaniAminabadi
requested a review
from
conglongli
3 years ago
RezaYazdaniAminabadi
requested a review
from
eltonzheng
3 years ago
RezaYazdaniAminabadi
requested a review
from
jeffra
3 years ago
RezaYazdaniAminabadi
requested a review
from
minjiaz
3 years ago
RezaYazdaniAminabadi
requested a review
from
samyam
3 years ago
RezaYazdaniAminabadi
requested a review
from
ShadenSmith
3 years ago
RezaYazdaniAminabadi
requested a review
from
tjruwase
3 years ago
Merge branch 'master' into gptj-inference-support
97e5ab2c
jeffra
enabled auto-merge (squash)
3 years ago
jeffra
approved these changes on 2022-01-08
jeffra
merged
289c3f9b
into master
3 years ago
mrwyattii
deleted the gptj-inference-support branch
2 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
jeffra
awan-10
cli99
conglongli
eltonzheng
minjiaz
samyam
ShadenSmith
tjruwase
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub