transformers
[Qwen3.5] Fix GDN linear attention multi-token cached forward
#45513
Merged

[Qwen3.5] Fix GDN linear attention multi-token cached forward #45513

kashif
kashif Fix Qwen3.5 linear attention multi-token cached forward
49a614ca
kashif kashif requested a review from Cyrilvallez Cyrilvallez 28 days ago
HuggingFaceDocBuilderDev
Cyrilvallez
kashif Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
8c1317b6
kashif
vasqu
vasqu approved these changes on 2026-04-23
kashif
kashif
vasqu
vasqu commented on 2026-04-23
vasqu
kashif
kashif Review feedback: unify cached-forward state flag, gate single-token/c…
e395f5ac
kashif kashif force pushed from 0f144648 to e395f5ac 24 days ago
kashif Propagate linear-attention multi-token cached-forward fix to qwen3_ne…
29a76b36
kashif kashif changed the title [Qwen3.5] Fix Qwen3.5 linear attention multi-token cached forward [Qwen3.5] Fix GDN linear attention multi-token cached forward 24 days ago
kashif Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
983aedf6
kashif Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
23c2b84a
kashif Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
8d4864bd
kashif Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
cec6c2fb
kashif
vasqu
vasqu approved these changes on 2026-04-27
kashif Review feedback: keep "prefill mode / multi-token decode" comment, la…
40f471b9
github-actions
vasqu
github-actions
vasqu
github-actions
vasqu
vasqu vasqu merged f53ca05d into main 20 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone