llama.cpp
tokenizer : special token handling
#3538
Merged

tokenizer : special token handling #3538

ggerganov merged 11 commits into ggml-org:master from staviq:specialtokens
staviq
staviq Rewrite special token handling from #1931
b592c70d
ggerganov
ggerganov commented on 2023-10-08
ggerganov
ggerganov commented on 2023-10-08
ggerganov
ggerganov commented on 2023-10-10
staviq
staviq shorten param name, add st verification by type
fc634d87
apage43
goerch
jploski
staviq use offsets instead of copy by substr
29e6b46e
staviq Merge branch 'master' into specialtokens
eac5f544
cebtenzzre
cebtenzzre commented on 2023-10-11
staviq formatting, remove copying iterator on delete
f7b1205a
staviq
ggerganov
ggerganov Merge branch 'master' into HEAD
04ac0558
ggerganov llama : normalize code-style
5c6b2be1
ggerganov ggerganov marked this pull request as ready for review 1 year ago
ggerganov
ggerganov approved these changes on 2023-10-12
ggerganov ggerganov changed the title (wip) tokenizer: special token handling tokenizer : special token handling 1 year ago
ggerganov ggerganov added need feedback
staviq
staviq swift fix
0f1c5695
staviq print pfx/sfx if verb, main: split pfx input sfx
5974d617
staviq
staviq
staviq dont add space when using special tokens
1c28116d
staviq
jxy
ggerganov ggerganov requested a review from ggerganov ggerganov 1 year ago
ggerganov minor : comment + spacing
fc82541b
ggerganov ggerganov merged 1a159553 into master 1 year ago
ggerganov
halbtuerke
slaren
slaren commented on 2023-10-17
WolframRavenwolf
cebtenzzre
WolframRavenwolf
staviq
ggerganov
shibe2
staviq
shibe2
WolframRavenwolf
staviq
WolframRavenwolf
jploski
WolframRavenwolf
teleprint-me
shibe2
teknium1
WolframRavenwolf
jploski
WolframRavenwolf
ArthurZucker
ArthurZucker commented on 2023-10-19
cebtenzzre
staviq
shibe2
cebtenzzre
shibe2
ArthurZucker
shibe2

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone