llama.cpp
fix : lookup word in vocab before doing BPE merges
#7193
Merged

fix : lookup word in vocab before doing BPE merges #7193

tonyfettes
tonyfettes tonyfettes marked this pull request as draft 1 year ago
tonyfettes tonyfettes changed the title Llama3 tokenizer ignore merge fix : lookup word in vocab before doing BPE merges 1 year ago
tonyfettes tonyfettes marked this pull request as ready for review 1 year ago
mofosyne mofosyne added Review Complexity : Medium
mofosyne mofosyne added bugfix
mofosyne mofosyne requested a review from goerch goerch 1 year ago
ggerganov
ggerganov commented on 2024-05-10
tonyfettes tonyfettes force pushed from 4ba2e5c9 to 63207d17 1 year ago
ggerganov
ggerganov
tonyfettes
ggerganov
ggerganov approved these changes on 2024-05-10
tonyfettes fix: llama-3 ignore_merges
c21d5e13
tonyfettes test: add test for llama-3 bpe ignore_merges
c7614930
tonyfettes fix: set ignore_merges only for llama-3
8a51d3b1
tonyfettes fix: test-tokenizer-1-bpe --ingore-merges detection
5d30a6dd
tonyfettes fix: copy to fix fallthrough
1fb5b558
tonyfettes fix: change ignore_merges to bool
c3d0f41d
tonyfettes fix: add ignore merges tests to cmake
0c9a0aef
tonyfettes tonyfettes force pushed from 0f48f9e0 to 0c9a0aef 1 year ago
ggerganov
ggerganov commented on 2024-05-11
ggerganov llama : alternative merge ignore logic
b8d3cd53
ggerganov ggerganov merged f99e1e45 into master 1 year ago
github-actions

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone