DeepSpeed
Fix gpt-Neox rotary embedding implementation
#2782
Merged

Fix gpt-Neox rotary embedding implementation #2782

jeffra merged 16 commits into master from fix-neox-rope
RezaYazdaniAminabadi
Reset KV-cache at the beginning of text-generation
8e91537e
Merge branch 'master' of github.com:microsoft/DeepSpeed
963cc9a5
Merge branch 'master' of github.com:microsoft/DeepSpeed
28be9bba
Merge branch 'master' of github.com:microsoft/DeepSpeed
ae91f324
fix the seq_id used for RoPE implementation
d48eb478
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from jeffra jeffra 2 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from mrwyattii mrwyattii 2 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from awan-10 awan-10 2 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from cmikeh2 cmikeh2 2 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from arashb arashb 2 years ago
remove reset-cache
e877e1df
lekurile
lekurile approved these changes on 2023-02-02
fix formatting
e1c82659
RezaYazdaniAminabadi RezaYazdaniAminabadi enabled auto-merge (squash) 2 years ago
RezaYazdaniAminabadi Merge branch 'master' into fix-neox-rope
0f4a0feb
RezaYazdaniAminabadi Merge branch 'master' into fix-neox-rope
352c5716
RezaYazdaniAminabadi Merge branch 'master' into fix-neox-rope
85dcdc96
tjruwase Merge branch 'master' into fix-neox-rope
2888414b
tjruwase Merge branch 'master' into fix-neox-rope
6199ca38
RezaYazdaniAminabadi Merge branch 'master' into fix-neox-rope
de1dd42d
tjruwase Merge branch 'master' into fix-neox-rope
d96115ac
tjruwase Merge branch 'master' into fix-neox-rope
8431e139
jeffra Merge branch 'master' into fix-neox-rope
4d237cd7
jeffra
jeffra approved these changes on 2023-02-16
disabled auto-merge 2 years ago
Manually disabled by user
jeffra jeffra merged 5b7413a4 into master 2 years ago
jeffra jeffra deleted the fix-neox-rope branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone