Fix load balancing loss func for mixtral #28256
Correct the implementation of auxiliary loss of mixtrtal
64ec7cd1
correct the implementation of auxiliary loss of mixtrtal
fc6b8b09
Implement a simpler calculation method
0fe02443
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub