llama.cpp
57c1e056 - llama: offload output layer to GPU first (#18148)

Commit
35 days ago
llama: offload output layer to GPU first (#18148)
Parents
Loading