vllm
[Models][Qwen3 ViT] Keep `max_seqlen` on CPU to prevent D2H sync
#37139
Merged

Loading