llama.cpp
Add support for Qwen2VL
#10361
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
35
Changes
View On
GitHub
Add support for Qwen2VL
#10361
ggerganov
merged 35 commits into
ggml-org:master
from
HimariO:qwen2-vl
github-actions
added
build
github-actions
added
Nvidia GPU
github-actions
added
examples
github-actions
added
python
github-actions
added
ggml
Barebone Qwen2VL LLM convertor
c17546ff
Add Qwen2VL cli entrypoint
7c6f7934
[WIP] add qwen2vl arch
b24bd89e
Verify m-rope output
35411963
Add vl-rope/2d-rope support for qwen2vl ViT
9d389a05
update qwen2vl cli tool
f661483e
update 5D tensor op workaround
3c3691e1
[WIP] qwen2vl vision model
c13edfed
make batch and clip utils compatible with qwen2vl
7e9fc720
[WIP] create inference workflow, gguf convert script but fix
bcd49f59
correcting vision-rope behavior, add the missing last layer back to ViT
023f0076
add arg parser to qwen2vl_surgery
3d19dd44
replace variable size array with vector
53480d2b
cuda-gdb cmake preset
0882f576
add fp32 mrope, vision rope kernel
3237bb46
add fp16 support for qwen2vl and m-rope
201f7043
add `GGML_ROPE_TYPE_MROPE`, `GGML_ROPE_TYPE_VISION`
f1fa60f8
fix rope op mode switching, out dated func args
241bb457
update `llama_hparams`
07553cfb
update to keep up stream changes
fac03453
resolve linter, test errors
cbd08b42
HimariO
force pushed
to
cbd08b42
1 year ago
HimariO
marked this pull request as ready for review
1 year ago
ggerganov
commented on 2024-12-05
HimariO
force pushed
1 year ago
add makefile entry, update speical image padding token
6c39aa38
HimariO
force pushed
to
6c39aa38
1 year ago
add mrope unit test, fix few compiler warnings
ac2089c3
rename `mrope` related function, params
12f17f75
github-actions
added
testing
minor updates on debug util, bug fixs
3ba7664d
add `m-rope` testcase to `test-backend-ops`
b24ab863
Merge branch 'master' into qwen2-vl
d7edc550
ggerganov
approved these changes on 2024-12-12
Apply suggestions from code review
9abb2527
Merge branch 'ggerganov:master' into qwen2-vl
c292bf1d
fix traililng whitespce
e9748e4a
store `llama_hparams.rope_sections` with fixed size array
ef7f74bd
update position id tensor size check in GGML_OP_ROPE
e2e9a6ca
github-actions
added
Apple Metal
ggerganov
commented on 2024-12-13
minor updates
a02a1907
github-actions
added
Vulkan
github-actions
added
SYCL
github-actions
added
Kompute
HimariO
force pushed
1 year ago
ggerganov
commented on 2024-12-13
HimariO
force pushed
1 year ago
ggerganov
commented on 2024-12-13
HimariO
force pushed
1 year ago
update `ggml_backend_*_supports_op` of unsupported backends
19aba1db
HimariO
force pushed
to
19aba1db
1 year ago
remote old `rope_section` compare operator
f96909e2
ggerganov
merged
ba1cb19c
into master
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
slaren
Assignees
No one assigned
Labels
build
testing
Nvidia GPU
Vulkan
examples
python
ggml
SYCL
Apple Metal
Kompute
Milestone
No milestone
Login to write a write a comment.
Login via GitHub