llama.cpp
57c1e056 - llama: offload output layer to GPU first (#18148)

Commit
140 days ago
llama: offload output layer to GPU first (#18148)
Parents
Loading