llama.cpp
llama : refactor llama_context, llama_kv_cache, llm_build_context (v2)
#12181
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
16
Changes
View On
GitHub
llama : refactor llama_context, llama_kv_cache, llm_build_context (v2)
#12181
ggerganov
merged 16 commits into
master
from
gg/llama-kv-cache-v2
github-actions
added
android
github-actions
added
examples
github-actions
added
python
github-actions
added
server
ggerganov
force pushed
from
1bdfacc9
to
900f2faa
187 days ago
ggerganov
force pushed
from
564747be
to
5bb8a26c
187 days ago
ggerganov
force pushed
from
48632552
to
250f398b
187 days ago
ggerganov
force pushed
from
25a58486
to
72a46666
186 days ago
ggerganov
force pushed
from
72a46666
to
905164fb
186 days ago
ggerganov
force pushed
from
f85d0b32
to
766edbf0
185 days ago
ggerganov
force pushed
from
766edbf0
to
62ba774b
185 days ago
ggerganov
marked this pull request as ready for review
185 days ago
ggerganov
requested a review
from
ngxson
185 days ago
slaren
approved these changes on 2025-03-11
ggerganov
force pushed
from
62ba774b
to
a170669c
181 days ago
llama : refactor llama_context, llama_kv_cache, llm_build_context
55909257
graph : don't mutate the KV cache during defrag
75624a20
context : reduce virtuals + remove test function
5aa3518d
context : move interface implementation to source file + factory
0a6648ca
graph : move KV cache build functions to llama_context impl
cc9fa25a
graph : remove model reference from build_pooling
29c9ef56
graph : remove llama_model reference
bc825604
kv_cache : provide rope factors
ff95ffdf
graph : rework inputs to use only unique_ptr, remove attn input abstr…
562a4787
context : remove llama_context_i abstraction
d0cb3196
context : clean-up
a4fc4e8e
graph : clean-up
af9f6b8e
llama : remove redundant keywords (struct, enum)
226ff010
model : adapt gemma3
5fc6dbd9
ggerganov
force pushed
from
a170669c
to
5fc6dbd9
180 days ago
graph : restore same attention ops as on master
70ef6530
llama : remove TODO + fix indent
31b8eab5
ggerganov
merged
e0dbec0b
into master
179 days ago
ggerganov
deleted the gg/llama-kv-cache-v2 branch
179 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
slaren
ngxson
Assignees
No one assigned
Labels
android
examples
python
server
Milestone
No milestone
Login to write a write a comment.
Login via GitHub