vllm
ee3cf457 - [XPU] Initial support for GDN attention on Qwen3-next/Qwen3.5 (#33657)

Commit
19 days ago
[XPU] Initial support for GDN attention on Qwen3-next/Qwen3.5 (#33657) Signed-off-by: Yan Ma <yan.ma@intel.com> Signed-off-by: Chendi Xue <chendi.xue@intel.com> Co-authored-by: Chendi Xue <chendi.xue@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>
Author
Parents
Loading