llama.cpp
sampling: reuse token data buffer in llama_sampler_sample
#18365
Merged

sampling: reuse token data buffer in llama_sampler_sample #18365

JayZenith
JayZenith JayZenith requested a review from ggerganov ggerganov 64 days ago
JayZenith JayZenith closed this 64 days ago
JayZenith JayZenith reopened this 64 days ago
ggerganov
ggerganov approved these changes on 2025-12-29
JayZenith JayZenith force pushed from 7ab07509 to f40f43a7 60 days ago
JayZenith JayZenith force pushed from f40f43a7 to 517bb736 60 days ago
JayZenith JayZenith force pushed from 517bb736 to 515d8dc7 60 days ago
JayZenith sampling: reuse token data buffer in llama_sampler_sample
e19fb5f1
JayZenith move cur buffer before timing section, after samplers
61c2d17e
JayZenith JayZenith force pushed from 515d8dc7 to 61c2d17e 60 days ago
ggerganov minor : fix build
08331ddf
ggerganov ggerganov merged c32fa21d into master 59 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone