llama.cpp
llama : disable graph reuse with pipeline parallelism
#20463
Merged

llama : disable graph reuse with pipeline parallelism #20463

ggerganov
ggerganov llama : disable graph reuse with pipeline parallelism
dfa3ad18
ggerganov ggerganov force pushed from 7bc73aef to dfa3ad18 59 days ago
ggerganov Revert "CUDA: Improve performance via less synchronizations between …
14be4a46
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler
JohannesGaessler approved these changes on 2026-03-12
ORippler
ORippler approved these changes on 2026-03-12
ggerganov ggerganov merged 57819b8d into master 59 days ago
ggerganov ggerganov deleted the gg/llama-disable-graph-reuse-with-pp branch 59 days ago
Superbobo75
aendk
aendk

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone