llama.cpp
[CUDA] Reduce CPU-side stalls due to the CUDA command buffer being full
#19042
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
Commits
[CUDA] Reduce CPU-side stalls due to the CUDA command buffer being full
gaugarg-nv
committed
146 days ago
Set the env variable in the CUDA backend registry allocation
gaugarg-nv
committed
146 days ago
Add link to PR in code comment
gaugarg-nv
committed
146 days ago
Remove warning logs and update documentation
gaugarg-nv
committed
143 days ago
Loading