DeepSpeed
Increasing the token-length based on available memory for GPT models
#2280
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
14
Changes
View On
GitHub
Increasing the token-length based on available memory for GPT models
#2280
RezaYazdaniAminabadi
merged 14 commits into
cholmes/fix-long-seq-len-inference
from
ds-inference/support-large-token-length
increasing the token-length based on available memory & reduce memory…
6d7c133d
RezaYazdaniAminabadi
requested a review
from
jeffra
3 years ago
RezaYazdaniAminabadi
requested a review
from
samyam
3 years ago
RezaYazdaniAminabadi
requested a review
from
tjruwase
3 years ago
RezaYazdaniAminabadi
requested a review
from
ShadenSmith
3 years ago
RezaYazdaniAminabadi
requested a review
from
conglongli
3 years ago
RezaYazdaniAminabadi
requested a review
from
awan-10
3 years ago
RezaYazdaniAminabadi
requested a review
from
cli99
3 years ago
RezaYazdaniAminabadi
requested a review
from
eltonzheng
3 years ago
RezaYazdaniAminabadi
requested a review
from
minjiaz
3 years ago
RezaYazdaniAminabadi
requested a review
from
duli2012
3 years ago
RezaYazdaniAminabadi
requested a review
from
mrwyattii
3 years ago
RezaYazdaniAminabadi
requested a review
from
yaozhewei
3 years ago
RezaYazdaniAminabadi
requested a review
from
arashb
3 years ago
RezaYazdaniAminabadi
requested a review
from
xiaoxiawu-microsoft
3 years ago
RezaYazdaniAminabadi
requested a review
from
samadejacobs
3 years ago
RezaYazdaniAminabadi
requested a review
from
cmikeh2
3 years ago
Merge branch 'master' of github.com:microsoft/DeepSpeed into ds-infer…
216b9538
RezaYazdaniAminabadi
changed the base branch from
master
to
cholmes/fix-long-seq-len-inference
3 years ago
merging
b15f2410
Merge branch 'cholmes/fix-long-seq-len-inference' of github.com:micro…
a08a3bfd
formating
12a58142
fix compile issue
89baf13f
Merge branch 'cholmes/fix-long-seq-len-inference' into ds-inference/s…
50d9963e
RezaYazdaniAminabadi
requested a review
from
GuanhuaWang
3 years ago
Merge branch 'cholmes/fix-long-seq-len-inference' into ds-inference/s…
53ed3cc8
fix the max_out_tokens to use a dynamic range based on available memory
a149a5a4
fix the issue with empty prompt
c9b604df
Merge branch 'cholmes/fix-long-seq-len-inference' into ds-inference/s…
b018ec9c
fix residual-add
fb89f198
fix some issues with unit tests
56f80290
fix formatting
ac2698fd
RezaYazdaniAminabadi
merged
d8f52032
into cholmes/fix-long-seq-len-inference
3 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
jeffra
samyam
tjruwase
ShadenSmith
conglongli
awan-10
cli99
eltonzheng
minjiaz
duli2012
mrwyattii
yaozhewei
arashb
xiaoxiawu-microsoft
samadejacobs
cmikeh2
GuanhuaWang
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub