speculative : fix batch sizes at initialization #9963
speculative : fix batch sizes at initialization
47bb241c
slaren
approved these changes
on 2024-10-20
speculative : handle params.n_predict == -1
67d18498
speculative : limit batch size to llama_n_batch
90ab8a10
ggerganov
merged
bc219750
into master 1 year ago
ggerganov
deleted the gg/speculative-fixes branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub