llama : disable graph reuse with pipeline parallelism #20463
llama : disable graph reuse with pipeline parallelism
dfa3ad18
ggerganov
force pushed
from
7bc73aef
to
dfa3ad18
59 days ago
Revert "CUDA: Improve performance via less synchronizations between …
14be4a46
ORippler
approved these changes
on 2026-03-12
ggerganov
merged
57819b8d
into master 59 days ago
ggerganov
deleted the gg/llama-disable-graph-reuse-with-pp branch 59 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub