mlp_only_layers is more flexible than decoder_sparse_step #30552
force back to commit ba40a21 and fix workflow errors
a40396b2
eigen2017
force pushed
to
a40396b2
1 year ago
match the review suggestions
abcf2e06
fix ci errors
5de12160
fix CI
0bf30308
fix ci, format code
6ad31eef
fix ci, ruff format
ff3f8ffd
fix ci, ruff format again
df936692
Update src/transformers/models/qwen2_moe/configuration_qwen2_moe.py
aa3a1045
Update src/transformers/models/qwen2_moe/configuration_qwen2_moe.py
cce6eae2
Update src/transformers/models/qwen2_moe/configuration_qwen2_moe.py
6a6e4da0
solve this warning: Default Argument Value is mutable
fec56e61
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub