DeepSpeed
770967f5 - fixed: Modified the topkgating function and modified the test_moe file for testing (#7163)

Commit
108 days ago
fixed: Modified the topkgating function and modified the test_moe file for testing (#7163) Since the previous PR encountered the DCO problem and could not be solved for some reason, I resubmitted a completely identical PR but without the problem. --------- Signed-off-by: xiongjyu <xiongjyu@gmail.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Olatunji Ruwase <tjruwase@gmail.com> Co-authored-by: Masahiro Tanaka <81312776+tohtana@users.noreply.github.com>
Author
Parents
Loading