vllm
a8eab8f3
- [Model] Extract GatedDeltaNetAttention into shared layer for Qwen3Next and Qwen3.5 (#37975)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
36 days ago
[Model] Extract GatedDeltaNetAttention into shared layer for Qwen3Next and Qwen3.5 (#37975) Signed-off-by: wxsIcey <1790571317@qq.com> Signed-off-by: Icey <1790571317@qq.com>
References
#37975 - [Model] Extract GatedDeltaNetAttention into shared layer for Qwen3Next and Qwen3.5
Author
wxsIcey
Parents
2babac0b
Loading