llama.cpp
57c1e056 - llama: offload output layer to GPU first (#18148)

Commit
1 day ago
llama: offload output layer to GPU first (#18148)
Parents
Loading