llama.cpp
llama: (proposal) propagating the results of `graph_compute` to the user interface
#9525
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
7
Changes
View On
GitHub
Commits
llama: propagating the results of `graph_compute` to the user interface
Xarbirus
committed
1 year ago
llama: reverting kv_cache in case of failed compute
Xarbirus
committed
1 year ago
llama: `llama_kv_cache_state` was removed, only the result of `llama_graph_compute` is returned
Xarbirus
committed
1 year ago
llama: restore a kv_cache in case of failed computation
Xarbirus
committed
1 year ago
llama: correct reverting of the entire batch.
Xarbirus
committed
1 year ago
llama: updated comments
Xarbirus
committed
1 year ago
llama : add comments about KV cache state after error
ggerganov
committed
1 year ago
Loading