Optimize Qwen2VL vision model by precomputing cos/sin embeds before ViT blocks #35837
li-plus
force pushed
from
2e912f3f
to
343e47ed
1 year ago
li-plus
force pushed
from
343e47ed
to
64c28270
1 year ago
li-plus
force pushed
from
64c28270
to
8c220896
1 year ago
Optimize Qwen2VL vision model by precomputing cos/sin embeds before V…
2357261a
Make rotary_pos_emb optional & fix type
984c2ffb
Adapt pre-computed cos/sin to Qwen2.5VL
79907108
li-plus
force pushed
from
8c220896
to
79907108
1 year ago
More concise
65b620da
li-plus
force pushed
from
40d625d9
to
65b620da
1 year ago
li-plus
deleted the fast-qwen2vl-vision-rope branch 1 year ago
Assignees
No one assigned
Labels
optimization
Multimodal
Login to write a write a comment.
Login via GitHub