transformers
mlp_only_layers is more flexible than decoder_sparse_step
#30552
Merged

mlp_only_layers is more flexible than decoder_sparse_step #30552

ArthurZucker merged 11 commits into huggingface:main from eigen2017:main
eigen2017
amyeroberts
eigen2017
amyeroberts
eigen2017 force back to commit ba40a21 and fix workflow errors
a40396b2
eigen2017 eigen2017 force pushed to a40396b2 1 year ago
eigen2017
eigen2017
amyeroberts
amyeroberts commented on 2024-05-01
eigen2017
eigen2017 match the review suggestions
abcf2e06
eigen2017 fix ci errors
5de12160
eigen2017 fix CI
0bf30308
eigen2017 fix ci, format code
6ad31eef
eigen2017 fix ci, ruff format
ff3f8ffd
eigen2017 fix ci, ruff format again
df936692
eigen2017
eigen2017
eigen2017
eigen2017
ArthurZucker
ArthurZucker approved these changes on 2024-05-07
eigen2017
eigen2017 Update src/transformers/models/qwen2_moe/configuration_qwen2_moe.py
aa3a1045
eigen2017 Update src/transformers/models/qwen2_moe/configuration_qwen2_moe.py
cce6eae2
eigen2017
ArthurZucker
ArthurZucker commented on 2024-05-08
HuggingFaceDocBuilderDev
eigen2017 Update src/transformers/models/qwen2_moe/configuration_qwen2_moe.py
6a6e4da0
eigen2017 solve this warning: Default Argument Value is mutable
fec56e61
eigen2017
eigen2017
ArthurZucker
ArthurZucker approved these changes on 2024-05-10
ArthurZucker ArthurZucker merged 1c52cb7b into main 1 year ago
eigen2017
ArthurZucker

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone