llama.cpp
sampling : optimize samplers by reusing bucket sort
#15665
Merged

sampling : optimize samplers by reusing bucket sort #15665

ggerganov merged 9 commits into master from gg/sampling-sort-opt
ggerganov
github-actions github-actions added testing
JohannesGaessler
JohannesGaessler approved these changes on 2025-08-29
ggerganov ggerganov requested a review from ngxson ngxson 146 days ago
github-actions github-actions added examples
github-actions github-actions added server
ggerganov
ggerganov commented on 2025-08-30
ggerganov ggerganov force pushed to 2efc7e46 146 days ago
ggerganov ggerganov requested a review from slaren slaren 146 days ago
JohannesGaessler
JohannesGaessler commented on 2025-08-30
slaren
slaren approved these changes on 2025-08-30
ggerganov sampling : optimize sorting using bucket sort in more places
7d34a4bf
ggerganov sampling : do not sort in dist sampler
97167e61
ggerganov sampling : avoid heap allocations for sort buffers
15557b8f
ggerganov common : add option to sort sampling candidates by probability
70bce496
ggerganov sampling : revert the change for preserving sort buffers
c8a2adaf
ggerganov sampling : use std::copy instead of memcpy
de2902dc
ggerganov sampling : clarify purpose of partial sort helpers
6d2a38c7
ggerganov ggerganov force pushed from d74a6ab4 to 6d2a38c7 145 days ago
ggerganov
ggerganov cont : remove wrong comment [no ci]
08d5ff4f
ggerganov common : update comment
1136efb2
ggerganov ggerganov merged e92d53b2 into master 145 days ago
ggerganov ggerganov deleted the gg/sampling-sort-opt branch 145 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone