llama.cpp
57c1e056
- llama: offload output layer to GPU first (#18148)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 day ago
llama: offload output layer to GPU first (#18148)
References
#18148 - llama: offload output layer to GPU first
Author
JohannesGaessler
Parents
9cff4cc5
Loading