llama.cpp
backend : offload large batches to GPU
#6083
Merged

Loading