transformers
Add support for MiniMax-M2
#42028
Merged

Add support for MiniMax-M2 #42028

vasqu merged 46 commits into huggingface:main from rogeryoungh:minimax-m2
rogeryoungh
update: init m2
53639168
update: docs and config
261fe5cc
update: init minimax-m2 test
ac4613ca
update: fix tests
3421fe74
update: use partial_rotary_factor
3a5df7a9
update: some fix
cb17f622
fix: import Unpack from processing_utils
f6775d80
molbap
molbap approved these changes on 2025-11-05
rogeryoungh update: apply suggestions from code review
73904ee7
update: remove MiniMaxM2DecoderLayer and MiniMaxM2MLP
6b7e3973
molbap molbap requested a review from Cyrilvallez Cyrilvallez 219 days ago
molbap
molbap commented on 2025-11-12
update: remove use_qk_norm
15657210
update: remove unused use_qk_norm
fa093014
molbap
molbap molbap requested a review from ArthurZucker ArthurZucker 218 days ago
Qubitium
vasqu
vasqu commented on 2025-11-19
vasqu
update: update config and attention
11fdb585
update: add to tokenization_auto and remove unused test
93b7598d
rogeryoungh Merge branch 'main' into minimax-m2
5de5179d
update: fix decoder layer and experts
f5219ca7
update: fix docs
ef9a1f98
rogeryoungh rogeryoungh force pushed from ae28a324 to ef9a1f98 207 days ago
update: make ci happy
a0eea925
rogeryoungh
MekkCyber
vasqu
vasqu commented on 2025-11-24
refactor: use mapping
2dbfc3b8
update: remove unused comments
640fb9f1
rogeryoungh Merge branch 'main' into minimax-m2
8fe4d2bd
rogeryoungh
vasqu
rogeryoungh
molbap
vasqu
MekkCyber
vasqu
vasqu commented on 2025-11-27
rogeryoungh Merge branch 'main' into minimax-m2
17c68c83
update: fix rope_params and router
8ba23a6e
rogeryoungh
MekkCyber
vasqu
vasqu commented on 2025-12-05
update: remove rope_theta
8a76b781
update: test_load_balancing_loss
bcc0aa12
rogeryoungh Merge branch 'main' into minimax-m2
5a82172b
update: docs
47c84e2f
rogeryoungh
vasqu
vasqu commented on 2025-12-08
vasqu
update: fix default theta
fdb6807e
rogeryoungh
ArthurZucker
vasqu update to proper default values, proper config rope, simplified modular
1f07a00a
vasqu fix docs
25a8b5c4
vasqu modular fixup
7ea9a2b3
vasqu Merge branch 'main' into minimax-m2
13ed0be9
vasqu
ArthurZucker
ArthurZucker approved these changes on 2025-12-11
vasqu review comments
9eebcedf
vasqu Merge branch 'main' into minimax-m2
668dbef2
vasqu update slow tests
90acf7cc
vasqu
vasqu style
e4463e1b
github-actions
github-actions
vasqu fp32 strict
fde9591a
vasqu
github-actions
vasqu revert the flag
7b6824f6
huggingface huggingface deleted a comment from github-actions on 2025-12-11
vasqu
vasqu Merge branch 'main' into minimax-m2
dda58d9c
vasqu Merge branch 'main' into minimax-m2
04c0b5f3
vasqu sync with latest changes
87032353
vasqu fixup buffer init
53c19ac8
vasqu add cache exception to minimax m2 as we have a naming clash
28f26fd2
vasqu
github-actions
github-actions
vasqu fix dtype issue in gate
cca9ee72
vasqu
rogeryoungh
vasqu
vasqu vasqu closed this 162 days ago
vasqu vasqu reopened this 162 days ago
github-actions
vasqu Merge branch 'main' into minimax-m2
3d6d6718
github-actions
github-actions
vasqu lift fp8 test restriction and apply new linter rules
b867712c
github-actions
vasqu
github-actions
github-actions
github-actions
vasqu update docs
252d02d3
github-actions
vasqu
ArthurZucker
ArthurZucker approved these changes on 2026-01-09
vasqu vasqu enabled auto-merge (squash) 161 days ago
vasqu vasqu merged 7a2bf25f into main 161 days ago
HuggingFaceDocBuilderDev
vasqu vasqu added New model

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone