llama.cpp
speculative : add tree-based sampling example
#3624
Merged

speculative : add tree-based sampling example #3624

ggerganov merged 18 commits into master from speculative-tree
ggerganov
ggerganov sampling : one sequence per sampling context
5261aee8
ggerganov speculative : add tree-based sampling support
4de5a2d4
ggerganov ggerganov changed the title speculative : add tree-based sampling support speculative : add tree-based sampling example 2 years ago
KerfuffleV2
ggerganov
ggerganov speculative : reuse the n_parallel CLI param
32a67cbd
ggerganov speculative : refactor sampling
4a7f43f2
ggerganov examples : fix build after sampling refactoring
7e48e21b
ggerganov ggerganov force pushed to 7e48e21b 2 years ago
ggerganov batched : fix n_seq_id
0d96efab
ggerganov sampling : fix malloc
b5554b9e
ggerganov ggerganov force pushed to b5554b9e 2 years ago
ggerganov swift : fix build
b8acb6c9
ggerganov ggerganov force pushed to b8acb6c9 2 years ago
ggerganov swift : try to fix build
5b34bfa2
ggerganov prompts : add assistant.txt
00594910
ggerganov common : add llama_batch_add() and llama_batch_clear() helpers
360a3331
ggerganov speculative : minor refactor
1c626e2f
ggerganov ggerganov marked this pull request as ready for review 2 years ago
ggerganov ggerganov added refactoring
ggerganov ggerganov added need feedback
ggerganov
ggerganov minor : comments + rename
373d782d
ggerganov speculative : fix off-by-one for n_drafted
f07cd35d
KerfuffleV2
ggerganov speculative : fix the n_drafted fix + p constants
e6dd81f0
ggerganov
ggerganov Merge branch 'master' into speculative-tree
010c52ec
ggerganov Merge branch 'master' into speculative-tree
bd9451ca
ggerganov Merge branch 'master' into speculative-tree
ad2727d0
ggerganov ggerganov merged 0e89203b into master 2 years ago
KerfuffleV2
ggerganov
KerfuffleV2
ggerganov
jukofyork
jukofyork
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone