transformers
6acd5aec - Adding Qwen3 and Qwen3MoE (#36878)

Commit
258 days ago
Adding Qwen3 and Qwen3MoE (#36878) * Initial commit for Qwen3 * fix and add tests for qwen3 & qwen3_moe * rename models for tests. * fix * fix * fix and add docs. * fix model name in docs. * simplify modular and fix configuration issues * Fix the red CI: ruff was updated * revert ruff, version was wrong * fix qwen3moe. * fix * make sure MOE can load * fix copies --------- Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Author
Parents
Loading