llama.cpp
[CUDA] Reduce CPU-side stalls due to the CUDA command buffer being full
#19042
Merged

Commits
  • [CUDA] Reduce CPU-side stalls due to the CUDA command buffer being full
    gaugarg-nv committed 146 days ago
  • Set the env variable in the CUDA backend registry allocation
    gaugarg-nv committed 146 days ago
  • Add link to PR in code comment
    gaugarg-nv committed 146 days ago
  • Remove warning logs and update documentation
    gaugarg-nv committed 143 days ago
Loading