feat: improve qwen2-vl startup #2802
drbh
force pushed
from
066addd7
to
60b9c187
1 year ago
drbh
force pushed
from
60b9c187
to
3cc82978
1 year ago
drbh
force pushed
from
32a95640
to
a3049f10
1 year ago
drbh
force pushed
from
a3049f10
to
d671f6e0
1 year ago
drbh
changed the title feat: tokenize each request individually and increase warmup image size feat: improve qwen2-vl startup 1 year ago
drbh
force pushed
from
d671f6e0
to
320b520d
1 year ago
feat: tokenize each request individually and increase warmup image size
1bcfba30
feat: adjust rotary embed and avoid cuda graphs of size 2 and smaller
45e5c2c2
fix: address image resize and rebase changes
822bd045
feat: update to run qwen2-vl tests
bd59f961
drbh
force pushed
from
35b528e8
to
bd59f961
1 year ago
fix: tweak param types
37f92f2c
drbh
merged
eecca271
into main 1 year ago
drbh
deleted the improve-qwen2-vl-warmup branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub