vllm
f9e6db30 - [Models][Qwen3 ViT] Keep `max_seqlen` on CPU to prevent D2H sync (#37139)

Commit
44 days ago
[Models][Qwen3 ViT] Keep `max_seqlen` on CPU to prevent D2H sync (#37139) Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Author
Parents
Loading