llama.cpp
Multi GPU support, CUDA refactor, CUDA scratch buffer
#1703
Merged

Loading