YJHMITWEB
changed the title Fixing a severe reshape bug in sequence parallel alltoall Fixing the reshape bug in sequence parallel alltoall, which caused the model unable to converge1 year ago
YJHMITWEB
changed the title Fixing the reshape bug in sequence parallel alltoall, which caused the model unable to converge Fixing the reshape bug in sequence parallel alltoall, which caused the training unable to converge1 year ago
YJHMITWEB
changed the title Fixing the reshape bug in sequence parallel alltoall, which caused the training unable to converge Fixing the reshape bug in sequence parallel alltoall, which corrupted all QKV data1 year ago
Login to write a write a comment.
Login via GitHub