vllm
9c3fe993 - Flashinfer cuDNN backend for Qwen3 VL ViT attention (#34580)

Commit
55 days ago
Flashinfer cuDNN backend for Qwen3 VL ViT attention (#34580) Signed-off-by: Max Hu <maxhu@nvidia.com> Signed-off-by: Max Hu <hyoung2991@gmail.com> Co-authored-by: Max Hu <maxhu@nvidia.com> Co-authored-by: Shang Wang <shangw@nvidia.com>
Author
Parents
Loading