transformers
[Qwen3-next] Fix dimension mismatch in torch_chunk_gated_delta_rule and torch_recurrent_gated_delta_rule (#40963)
#41036
Merged

[Qwen3-next] Fix dimension mismatch in torch_chunk_gated_delta_rule and torch_recurrent_gated_delta_rule (#40963) #41036

notkisk
Rocketknight1
vasqu
vasqu commented on 2025-09-22
vasqu
github-actions
notkisk notkisk force pushed from 3e2394f2 to ad7b43d2 96 days ago
vasqu
vasqu approved these changes on 2025-09-22
notkisk notkisk force pushed from db58944b to 14bc2524 95 days ago
notkisk
notkisk notkisk requested a review from vasqu vasqu 95 days ago
notkisk fix mismatched dims for qwen3 next
198f82da
notkisk propagate changes
1962634a
notkisk chore: renamed tot_heads to total_sequence_length
7515843b
notkisk Apply suggestion from @vasqu
f3ab6fd8
notkisk minor fix to modular qwen3 next file
6e89ac06
notkisk notkisk force pushed from 9be36c92 to 6e89ac06 94 days ago
github-actions
vasqu
github-actions
vasqu vasqu enabled auto-merge (squash) 94 days ago
vasqu
vasqu vasqu merged 80f20e0f into main 94 days ago
HuggingFaceDocBuilderDev

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone