DeepSpeed
fix MegatronLayerPolicy to be compatible with the newest ParallelTransformerLayer
#4236
Merged

fix MegatronLayerPolicy to be compatible with the newest ParallelTransformerLayer #4236

dc3671
fix MegatronLayerPolicy to be compatible with the newest ParallelTran…
0a329ac4
dc3671 dc3671 requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
dc3671 dc3671 requested a review from jeffra jeffra 2 years ago
dc3671 dc3671 requested a review from mrwyattii mrwyattii 2 years ago
dc3671 dc3671 requested a review from awan-10 awan-10 2 years ago
dc3671 dc3671 requested a review from cmikeh2 cmikeh2 2 years ago
dc3671 dc3671 requested a review from arashb arashb 2 years ago
RezaYazdaniAminabadi
RezaYazdaniAminabadi approved these changes on 2023-08-30
RezaYazdaniAminabadi Merge branch 'master' into fix-megatron-gpt
2d97cf03
tjruwase tjruwase merged 6cbf6661 into master 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone