vllm
f9e6db30
- [Models][Qwen3 ViT] Keep `max_seqlen` on CPU to prevent D2H sync (#37139)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
44 days ago
[Models][Qwen3 ViT] Keep `max_seqlen` on CPU to prevent D2H sync (#37139) Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
References
#37139 - [Models][Qwen3 ViT] Keep `max_seqlen` on CPU to prevent D2H sync
Author
lgeiger
Parents
d61d2b08
Loading