llama.cpp
spec : refactor ctx
#22787
Closed
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
24
Changes
View On
GitHub
spec : refactor ctx
#22787
ggerganov
wants to merge 24 commits into
master
from
gg/spec-refactor-ctx
github-actions
added
examples
github-actions
added
server
ggerganov
force pushed
from
fbb14d55
to
1d5b0fee
44 days ago
ggerganov
force pushed
from
1d5b0fee
to
d719d8aa
44 days ago
github-actions
added
ggml
github-actions
added
Apple Metal
ggerganov
marked this pull request as ready for review
44 days ago
ggerganov
requested a review
44 days ago
ggerganov
requested a review
44 days ago
ggerganov
requested a review
44 days ago
ggerganov
requested a review
from
am17an
44 days ago
spec : refactor
2c9a4084
spec : drop support for incompatible vocabs
befc7ef6
spec : update common_speculative_init()
4550f0f0
cont : pass seq_id
77269ad8
cont : dedup ctx_seq_rm_type
8a50f6f0
server : sketch the ctx_dft decode loop
c97dc360
server : draft prompt cache and checkpoints
11fd5e72
server : improve ctx names
1afee5b2
server, spec : transition to unified spec context
de35b125
cont : sync main and drft contexts
08c8012b
cont : async drft eval when possible
c7facb0f
cont : handle non-ckpt models
0239f4c6
cont : pass correct n_past for drafting
ae6703fa
cont : process images throught the draft context
7e118cdc
ggerganov
force pushed
from
21e83adc
to
7e118cdc
44 days ago
am17an
commented on 2026-05-08
spec : handle draft running out of context
8be14e40
server : fix mtmd draft processing
6a4b05a0
server : fix URL for draft model
12c7cfbe
github-actions
added
python
server : add comment
233d1aee
server : clean-up + dry
3b1a8df8
speculative-simple : update
e5b14013
am17an
commented on 2026-05-08
am17an
commented on 2026-05-08
spec : fix n_past type
161eae0a
server : fix slot ctx_drft ptr
1dbc054d
tools : update readme
778f9e24
ggerganov
requested a review
from
ngxson
43 days ago
am17an
commented on 2026-05-08
naming : improve consistency
efa2f8e5
ggerganov
commented on 2026-05-08
ruixiang63
commented on 2026-05-08
ggerganov
closed this
40 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
am17an
pwilkin
ruixiang63
ngxson
Assignees
No one assigned
Labels
examples
python
server
ggml
Apple Metal
Milestone
No milestone
Login to write a write a comment.
Login via GitHub