vllm
9c3fe993
- Flashinfer cuDNN backend for Qwen3 VL ViT attention (#34580)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
55 days ago
Flashinfer cuDNN backend for Qwen3 VL ViT attention (#34580) Signed-off-by: Max Hu <maxhu@nvidia.com> Signed-off-by: Max Hu <hyoung2991@gmail.com> Co-authored-by: Max Hu <maxhu@nvidia.com> Co-authored-by: Shang Wang <shangw@nvidia.com>
References
#34580 - Flashinfer cuDNN backend for Qwen3 VL ViT attention
Author
maxyanghu
Parents
b66a7464
Loading