llama.cpp
vocab: add tokenizer support for jina-embeddings-v2-base-zh
#18756
Merged

vocab: add tokenizer support for jina-embeddings-v2-base-zh #18756

CISC merged 3 commits into ggml-org:master from o7si:issue-18452
o7si
o7si o7si marked this pull request as ready for review 157 days ago
o7si o7si requested a review from CISC CISC 157 days ago
o7si o7si requested a review from ggerganov ggerganov 157 days ago
github-actions github-actions added python
daveth3t3chg33k
daveth3t3chg33k commented on 2026-01-12
o7si
CISC
o7si o7si force pushed from b061faa9 to ccd55e4f 154 days ago
o7si o7si marked this pull request as draft 154 days ago
CISC
CISC commented on 2026-01-14
o7si
o7si
o7si
CISC
o7si
CISC
o7si vocab : add jina-embeddings-v2-base-zh (whitespace tokenizer)
0c1c9d33
o7si o7si force pushed from f3bce529 to 0c1c9d33 18 days ago
o7si
o7si o7si marked this pull request as ready for review 18 days ago
CISC
CISC approved these changes on 2026-05-29
CISC
CISC CISC added merge ready
CISC
CISC commented on 2026-05-29
CISC
CISC commented on 2026-05-30
CISC
CISC commented on 2026-05-30
CISC lowercase defaults to true
bbd3946c
CISC type fix
73ee68a1
ggerganov
ggerganov approved these changes on 2026-05-31
CISC
CISC commented on 2026-05-31
CISC CISC merged d4c8e2c2 into master 17 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone