llama.cpp
mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output)
#13784
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
17
Changes
View On
GitHub
mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output)
#13784
ngxson
merged 17 commits into
ggml-org:master
from
ngxson:xsn/qwen25omni
mtmd : allow multiple modalities at the same time
58c7849f
github-actions
added
examples
github-actions
added
python
refactor mtmd tokenizer
2782a583
fix compile
bb92d1d0
ok, missing SinusoidsPositionEmbedding
8b51e7fa
first working version
24ec43eb
fix style
1ac73f40
more strict validate of n_embd
90132453
refactor if..else to switch
346d2522
fix regression
6e65e0c5
add test for 3B
235fbdbf
update docs
bf34f38f
ngxson
marked this pull request as ready for review
215 days ago
ngxson
requested a review
from
ggerganov
215 days ago
github-actions
added
documentation
fix tokenizing with add_special
d03c2407
add more tests
ef48e8f2
fix test case "huge"
94d893d6
ggerganov
approved these changes on 2025-05-27
Merge branch 'master' into xsn/qwen25omni
baa882a5
rm redundant code
05310968
set_position_mrope_1d rm n_tokens
27a8f266
ngxson
merged
bc583e3c
into master
213 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
Assignees
No one assigned
Labels
documentation
examples
python
Milestone
No milestone
Login to write a write a comment.
Login via GitHub