llama.cpp
15a68420
- cuda : enable CUDA graphs for MMID BS <= 4
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 hours ago
cuda : enable CUDA graphs for MMID BS <= 4
References
gg/cuda-graphs-enable-bs-gt1
#19645 - cuda : enable CUDA graphs for MMID 1 <= BS <= 4
Author
ggerganov
Parents
8bc255a3
Loading