vllm
[Model] Extract GatedDeltaNetAttention into shared layer for Qwen3Next and Qwen3.5
#37975
Merged

[Model] Extract GatedDeltaNetAttention into shared layer for Qwen3Next and Qwen3.5 #37975

jikunshang merged 12 commits into vllm-project:main from wxsIcey:refactor-gdn
wxsIcey
mergify mergify added qwen
gemini-code-assist
wxsIcey
wxsIcey wxsIcey marked this pull request as ready for review 50 days ago
wxsIcey wxsIcey requested a review from sighingnow sighingnow 50 days ago
wxsIcey wxsIcey requested a review from tdoublep tdoublep 50 days ago
ZJY0516
vadiklyutiy
vadiklyutiy
gemini-code-assist
wxsIcey
ZJY0516
gemini-code-assist
gemini-code-assist commented on 2026-03-24
wangxiyuan
ZJY0516
yma11
yma11 commented on 2026-03-24
wxsIcey
yma11
wxsIcey
jikunshang
jikunshang commented on 2026-03-24
yma11
mergify
mergify mergify added needs-rebase
wxsIcey [Model] Extract GatedDeltaNetAttention into shared layer for Qwen3Nex…
42b2d199
wxsIcey fix ruff
4ddb549e
wxsIcey fix qwen3.5 lora
d3a5f548
wxsIcey fix error
80450185
wxsIcey fix qwen3-next
48fd81c1
wxsIcey fix lora
03a44293
wxsIcey mini fix
4ee9c371
wxsIcey resolve conflict
c750f33a
wxsIcey wxsIcey force pushed to c750f33a 48 days ago
jikunshang jikunshang added ready
mergify mergify removed needs-rebase
mergify
jikunshang
wxsIcey fix mypy
0218d050
wxsIcey Merge branch 'main' into refactor-gdn
814e2aa9
wxsIcey
yma11
yma11 approved these changes on 2026-03-27
yma11
jikunshang
claude
claude commented on 2026-03-27
wxsIcey remove unuse gdn_linear_attn
d043205d
wxsIcey remove unuse qkvz_output_size
25e94436
wxsIcey
jikunshang
jikunshang approved these changes on 2026-03-27
jikunshang jikunshang merged a8eab8f3 into main 47 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone