DeepSpeed
Fixing the reshape bug in sequence parallel alltoall, which corrupted all QKV data
#5664
Merged

Commits
  • fixing sequence parallel alltoall reshape bug
    Jinghan Yao committed 1 year ago
  • Merge branch 'master' into master
    loadams committed 1 year ago
Loading