llama.cpp
llama : cache llama_token_to_piece
#7587
Merged

llama : cache llama_token_to_piece #7587

mofosyne merged 4 commits into master from gg/cache-token-to-piece
ggerganov
github-actions
ggerganov ggerganov force pushed from 92b88a04 to 3e5d281c 1 year ago
mofosyne mofosyne added Review Complexity : Low
skoulik
ggerganov ggerganov requested a review from ochafik ochafik 1 year ago
ggerganov ggerganov requested a review from HanClinto HanClinto 1 year ago
ggerganov llama : cache llama_token_to_piece
9964cd02
ggerganov ggerganov force pushed from 3e5d281c to 9964cd02 1 year ago
ggerganov ggerganov added merge ready
HanClinto
HanClinto commented on 2024-05-29
HanClinto
HanClinto commented on 2024-05-29
HanClinto
HanClinto commented on 2024-05-29
ggerganov llama : use vectors and avoid has_cache
21ccd645
ggerganov llama : throw on unknown tokenizer types
1494a184
HanClinto
HanClinto commented on 2024-05-29
HanClinto
HanClinto commented on 2024-05-29
HanClinto
HanClinto commented on 2024-05-29
HanClinto
ggerganov llama : print a log of the total cache size
8a8f8b95
ggerganov ggerganov force pushed from 5069b93b to 8a8f8b95 1 year ago
HanClinto
HanClinto approved these changes on 2024-05-29
HanClinto
slaren
ggerganov
ochafik
ochafik approved these changes on 2024-05-30
ochafik
mofosyne mofosyne merged 5921b8f0 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone