transformers
Fix load balancing loss func for mixtral
#28256
Merged

Fix load balancing loss func for mixtral #28256

liangxuZhang
kalomaze
ArthurZucker
ArthurZucker commented on 2024-01-02
codybum
theblackcat102
ArthurZucker
liangxuZhang
ArthurZucker
ArthurZucker
ArthurZucker approved these changes on 2024-01-10
liangxuZhang Correct the implementation of auxiliary loss of mixtrtal
64ec7cd1
liangxuZhang correct the implementation of auxiliary loss of mixtrtal
fc6b8b09
liangxuZhang Implement a simpler calculation method
0fe02443
liangxuZhang liangxuZhang force pushed to 0fe02443 2 years ago
liangxuZhang
bratao
ArthurZucker
ArthurZucker ArthurZucker merged e768616a into main 2 years ago
ArthurZucker
dancingpipi
cryoco

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone