llama.cpp
55b62bce
- llama : reuse device buffers when possible
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
53 days ago
llama : reuse device buffers when possible
References
#22838 - spec : parallel drafting support
Author
ggerganov
Parents
f1652197
Loading