ggml : allow CUDA graphs when using pipeline parallelism #13814
ggml : allow CUDA graphs when using pipeline parallelism
c17627c8
ggerganov
approved these changes
on 2025-05-27
slaren
merged
952f3953
into master 1 year ago
slaren
deleted the sl/fix-cuda-graphs-pp branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub