DeepSpeed
6cbf6661 - fix MegatronLayerPolicy to be compatible with the newest ParallelTransformerLayer (#4236)

Comment changes are shownComment changes are hidden
Commit
1 year ago
fix MegatronLayerPolicy to be compatible with the newest ParallelTransformerLayer (#4236) Co-authored-by: Reza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
Author
Dino Chen
Parents
  • deepspeed/module_inject/containers
    • File
      megatron_gpt.py