PR #4484 Prompt lookup decoding

Prompt lookup decoding #4484

ggerganov merged 10 commits into ggml-org:master from LeonEricsson:prompt-lookup

initial commit, going through initializations

cae8f50b

main loop finished, starting to debug

0ec5fdb5

BUG: generates gibberish/repeating tokens after a while

1665ad8b

Merge branch 'ggerganov:master' into prompt-lookup

34048416

kv_cache management

21431197

Merge branch 'prompt-lookup' of github.com:LeonEricsson/llama.cpp int…

45b8032b

Added colors to distinguish drafted tokens (--color). Updated README

1b26d715

lookup : fix token positions in the draft batch

5b279754

ggerganov approved these changes on 2023-12-17

lookup : use n_draft from CLI params

d8ed670c

ggml-org deleted a comment from ghost on 2023-12-20

lookup : final touches

50ea1ef7

ggerganov merged 7082d24c into master 1 year ago

Reviewers

ggerganov

Assignees

No one assigned

Labels

None yet

Milestone

No milestone