llama.cpp
speculative : fix batch sizes at initialization
#9963
Merged

speculative : fix batch sizes at initialization #9963

ggerganov merged 3 commits into master from gg/speculative-fixes
ggerganov
ggerganov speculative : fix batch sizes at initialization
47bb241c
github-actions github-actions added examples
ggerganov ggerganov requested a review from slaren slaren 1 year ago
slaren
slaren approved these changes on 2024-10-20
slaren
ggerganov speculative : handle params.n_predict == -1
67d18498
ggerganov speculative : limit batch size to llama_n_batch
90ab8a10
ggerganov
ggerganov ggerganov merged bc219750 into master 1 year ago
ggerganov ggerganov deleted the gg/speculative-fixes branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone