llama.cpp
server : don't overfill the batch during infill
#10018
Merged

server : don't overfill the batch during infill #10018

ggerganov merged 1 commit into master from gg/infill-6
ggerganov
github-actions github-actions added examples
github-actions github-actions added server
ggerganov server : don't overfill the batch during infill
48d5a1f8
ggerganov ggerganov force pushed from b051dc9e to 48d5a1f8 1 year ago
ggerganov ggerganov merged 8125e6cb into master 1 year ago
ggerganov ggerganov deleted the gg/infill-6 branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone