YJHMITWEB
changed the title Fixing a severe reshape bug in sequence parallel alltoall Fixing the reshape bug in sequence parallel alltoall, which caused the model unable to converge2 years ago
YJHMITWEB
changed the title Fixing the reshape bug in sequence parallel alltoall, which caused the model unable to converge Fixing the reshape bug in sequence parallel alltoall, which caused the training unable to converge2 years ago
YJHMITWEB
changed the title Fixing the reshape bug in sequence parallel alltoall, which caused the training unable to converge Fixing the reshape bug in sequence parallel alltoall, which corrupted all QKV data2 years ago
Login to write a write a comment.
Login via GitHub