llama.cpp
Add Unigram tokenizer needed by T5 and FLAN-T5 model families
#8089
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
5
Changes
View On
GitHub
Add Unigram tokenizer needed by T5 and FLAN-T5 model families
#8089
fairydreaming
merged 5 commits into
ggml-org:master
from
fairydreaming:t5-clean-2
llama : add T5 model architecture, tensors and model header parameters
c2c799ce
mofosyne
added
Review Complexity : Medium
llama : add handling of byte tokens in UGM tokenizer (same as in SPM)
f4c03c09
ggerganov
approved these changes on 2024-06-25
llama : replace allocated precompiled_charsmap buffer with std::vecto…
87b7dd23
Merge branch 'ggerganov:master' into t5-clean-2
21d36842
llama : fix whitespace formatting
f23ff913
fairydreaming
merged
6fcbf682
into master
1 year ago
ggerganov
commented on 2024-07-02
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
Assignees
No one assigned
Labels
Review Complexity : Medium
Milestone
No milestone
Login to write a write a comment.
Login via GitHub