llama.cpp
tests : add test-tokenizer-0.sh
#7036
Merged

tests : add test-tokenizer-0.sh #7036

ggerganov merged 15 commits into master from gg/add-tokenizer-test-script
ggerganov
ggerganov tests : add test-tokenizer-0.sh
ce7d3a04
ggerganov ggerganov force pushed to ce7d3a04 1 year ago
ggerganov unicode : add all unicode number ranges
7053b261
ggerganov starcoder : fix pre-tokenizer
cf00fe1e
ggerganov tests : add test that fails with DeepSeek tokenizers
3a461dbf
ggerganov falcon : fix regex
3275e60f
github-actions
ggerganov ggerganov added high priority
ggerganov
CISC
ggerganov
ggerganov unicode : regenerate unicode tables
cd7c728a
ggerganov refact : add tokenizer model
d53240cc
ggerganov lint : fix
c30056a7
ggerganov tests : disable failing tests
bc26eb75
ggerganov refact : add tests files
9745cf88
teleprint-me
ggerganov Merge branch 'master' into gg/add-tokenizer-test-script
26f606ef
ggerganov convert : print -> logging
d974aed5
ggerganov lint : fix
5f30e30a
ggerganov ggerganov force pushed to 5f30e30a 1 year ago
ggerganov
ggerganov unicode : digit -> number
f19b45cb
ggerganov phi-3 : update
7e11d409
ggerganov ggerganov merged 92139b90 into master 1 year ago
ggerganov ggerganov deleted the gg/add-tokenizer-test-script branch 1 year ago
teleprint-me
DOGEwbx
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone