llama.cpp
[CUDA] Reduce CPU-side stalls due to the CUDA command buffer being full
#19042
Merged

[CUDA] Reduce CPU-side stalls due to the CUDA command buffer being full #19042

gaugarg-nv
gaugarg-nv [CUDA] Reduce CPU-side stalls due to the CUDA command buffer being full
d3298dc3
gaugarg-nv Set the env variable in the CUDA backend registry allocation
29c73efe
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
gaugarg-nv Add link to PR in code comment
14de97eb
JohannesGaessler
JohannesGaessler commented on 2026-01-24
ggerganov
am17an
gaugarg-nv Remove warning logs and update documentation
ed2e4840
gaugarg-nv
am17an
gaugarg-nv
am17an
github-actions github-actions added documentation
JohannesGaessler
JohannesGaessler approved these changes on 2026-01-26
ggerganov
ggerganov approved these changes on 2026-01-26
gaugarg-nv
ggerganov ggerganov merged a83c73a1 into master 73 days ago
ggerganov

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone