transformers
[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gated_delta_rule attention computation: "attn = (q_i @ k_i.transpose(-1, -2) * decay_mask[:, :, i]).masked_fill_(mask, 0)"
#45215
Merged

[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gated_delta_rule attention computation: "attn = (q_i @ k_i.transpose(-1, -2) * decay_mask[:, :, i]).masked_fill_(mask, 0)" #45215

Rocketknight1 merged 4 commits into huggingface:main from ENg-122:test_main
ENg-122
ENg-122 [Qwen3_5]Remove excess mask
b76f5957
ENg-122 Merge branch 'main' into test_main
061b7ee3
ENg-122 [Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gated_delta_r…
ce554fbd
github-actions
ENg-122 Fix: remove brackets to match generated code format
c8ace4dd
Rocketknight1
Rocketknight1 approved these changes on 2026-04-08
Rocketknight1 Rocketknight1 enabled auto-merge 76 days ago
HuggingFaceDocBuilderDev
Rocketknight1 Rocketknight1 merged 0aff0dbf into main 75 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone