DeepSpeed
Fixing inference api for FP32 and non-masking GPT-based models
#1204
Merged

Fixing inference api for FP32 and non-masking GPT-based models #1204

jeffra merged 6 commits into master from reyazda/fix-inference-api
RezaYazdaniAminabadi
fixing inference api for FP32 and non-masking GPT-based models
db492f60
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from awan-10 awan-10 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from cli99 cli99 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from conglongli conglongli 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from eltonzheng eltonzheng 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from jeffra jeffra 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from minjiaz minjiaz 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from niumanar niumanar 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from samyam samyam 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from ShadenSmith ShadenSmith 4 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from tjruwase tjruwase 4 years ago
use a dummy tensor if input_mask is none
e628299f
fix input_mask
485201bf
minor fix
65f1fe82
send input_mask to compute_attn func for checking
7cb65cc9
RezaYazdaniAminabadi Merge branch 'master' into reyazda/fix-inference-api
79ee82f6
jeffra
jeffra approved these changes on 2021-07-20
jeffra jeffra merged 6ba96289 into master 4 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone