Add LinearAttention and CausalConvState ops for Qwen3.5 #27907
Add LinearAttention and CausalConvState CUDA and CPU kernel
d4f55055
Kernel improvements
797a71c1
Update the kernel based on new op spec
c629e832
Optimize Mlas path
c8b9d6d8
Cuda kernel improvements
589080b1
Potential fix for code scanning alert no. 34715: Module is imported w…
31543cdc
Fix lint
bf9ab720
Fix bug
245efd03
Fix for unit tests
cdf82155
Address copilot comments
f55606c8
Updates docs
6bbcde98
Merge branch 'main' into asonawane/linearattention
14d1d14e
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub