[Misc][MoE] add Deepseek-V3 moe tuning support #12558
Deepseek-V3 moe tuning workaround
85e1a086
rm bypass graph logic. No longer reqd
edda3cb5
divakar-amd
changed the title Deepseek-V3 moe tuning workaround [Misc][MoE] add Deepseek-V3 moe tuning support 1 year ago
divakar-amd
marked this pull request as ready for review 1 year ago
mgoin
approved these changes
on 2025-01-29
mgoin
enabled auto-merge (squash) 1 year ago
mgoin
merged
1c1bb0bb
into main 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub