llama.cpp
Ignore invalid UTF-8 input in the BPE tokenizer
#11729
Merged

Ignore invalid UTF-8 input in the BPE tokenizer #11729

cfillion
ggerganov
ggerganov approved these changes on 2025-02-07
slaren
slaren commented on 2025-02-07
cfillion ignore invalid UTF-8 input in the BPE tokenizer
cff1c3bc
cfillion cfillion force pushed from 6c70a3a9 to cff1c3bc 245 days ago
slaren
slaren approved these changes on 2025-02-07
ggerganov ggerganov merged 2d219b38 into master 245 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone