llama.cpp
57c1e056 - llama: offload output layer to GPU first (#18148)

Commit
173 days ago
llama: offload output layer to GPU first (#18148)
Parents
Loading