transformers
bdf5fb70 - Skip non-selected experts for qwen3_moe (#38133)

Commit
194 days ago
Skip non-selected experts for qwen3_moe (#38133) * fix(qwen3moe): skip experts with no workload * avoid tolist and also update other moe models * fix: should squeeze 0-dim only
Author
Parents
Loading