llama.cpp
mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output)
#13784
Merged

mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) #13784

ngxson merged 17 commits into ggml-org:master from ngxson:xsn/qwen25omni
ngxson
ngxson mtmd : allow multiple modalities at the same time
58c7849f
github-actions github-actions added examples
github-actions github-actions added python
ngxson refactor mtmd tokenizer
2782a583
ngxson fix compile
bb92d1d0
ngxson ok, missing SinusoidsPositionEmbedding
8b51e7fa
ngxson first working version
24ec43eb
ngxson fix style
1ac73f40
ngxson more strict validate of n_embd
90132453
oyaay
ngxson refactor if..else to switch
346d2522
ngxson fix regression
6e65e0c5
ngxson add test for 3B
235fbdbf
ngxson update docs
bf34f38f
ngxson ngxson marked this pull request as ready for review 215 days ago
ngxson ngxson requested a review from ggerganov ggerganov 215 days ago
ngxson
github-actions github-actions added documentation
mega-cqz
ngxson fix tokenizing with add_special
d03c2407
ngxson add more tests
ef48e8f2
ngxson
ngxson fix test case "huge"
94d893d6
ggerganov
ggerganov approved these changes on 2025-05-27
ngxson Merge branch 'master' into xsn/qwen25omni
baa882a5
ngxson rm redundant code
05310968
ngxson set_position_mrope_1d rm n_tokens
27a8f266
ngxson ngxson merged bc583e3c into master 213 days ago
henfiber
pwilkin
nqchieutb01
alielmorsy

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone