[Qwen3.5] Fix GDN linear attention multi-token cached forward #45513
Fix Qwen3.5 linear attention multi-token cached forward
49a614ca
Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
8c1317b6
vasqu
approved these changes
on 2026-04-23
vasqu
commented
on 2026-04-23
Review feedback: unify cached-forward state flag, gate single-token/c…
e395f5ac
kashif
force pushed
from
0f144648
to
e395f5ac
24 days ago
Propagate linear-attention multi-token cached-forward fix to qwen3_ne…
29a76b36
kashif
changed the title [Qwen3.5] Fix Qwen3.5 linear attention multi-token cached forward [Qwen3.5] Fix GDN linear attention multi-token cached forward 24 days ago
Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
983aedf6
Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
23c2b84a
Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
8d4864bd
Merge branch 'main' into fix-qwen35-linear-attn-multi-token-cached
cec6c2fb
vasqu
approved these changes
on 2026-04-27
Review feedback: keep "prefill mode / multi-token decode" comment, la…
40f471b9
vasqu
merged
f53ca05d
into main 20 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub