Add `DenseMoELayer` and wire it up in Mixtral/Deepseek V2 #2537
Add `DenseMoELayer` and wire it up in Mixtral/Deepseek V2
f10144cd
danieldk
force pushed
from
e9090abc
to
f10144cd
1 year ago
Narsil
approved these changes
on 2024-09-24
danieldk
merged
3f14cd14
into main 1 year ago
danieldk
deleted the maintenance/dense-moe-layer branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub