llama.cpp
spec : refactor ctx
#22787
Closed

spec : refactor ctx #22787

ggerganov wants to merge 24 commits into master from gg/spec-refactor-ctx
ggerganov
github-actions github-actions added examples
github-actions github-actions added server
ggerganov ggerganov force pushed from fbb14d55 to 1d5b0fee 44 days ago
ggerganov ggerganov force pushed from 1d5b0fee to d719d8aa 44 days ago
github-actions github-actions added ggml
github-actions github-actions added Apple Metal
ggerganov ggerganov marked this pull request as ready for review 44 days ago
ggerganov ggerganov requested a review 44 days ago
ggerganov ggerganov requested a review 44 days ago
ggerganov ggerganov requested a review 44 days ago
ggerganov ggerganov requested a review from am17an am17an 44 days ago
ggerganov spec : refactor
2c9a4084
ggerganov spec : drop support for incompatible vocabs
befc7ef6
ggerganov spec : update common_speculative_init()
4550f0f0
ggerganov cont : pass seq_id
77269ad8
ggerganov cont : dedup ctx_seq_rm_type
8a50f6f0
ggerganov server : sketch the ctx_dft decode loop
c97dc360
ggerganov server : draft prompt cache and checkpoints
11fd5e72
ggerganov server : improve ctx names
1afee5b2
ggerganov server, spec : transition to unified spec context
de35b125
ggerganov cont : sync main and drft contexts
08c8012b
ggerganov cont : async drft eval when possible
c7facb0f
ggerganov cont : handle non-ckpt models
0239f4c6
ggerganov cont : pass correct n_past for drafting
ae6703fa
ggerganov cont : process images throught the draft context
7e118cdc
ggerganov ggerganov force pushed from 21e83adc to 7e118cdc 44 days ago
am17an
am17an commented on 2026-05-08
ggerganov spec : handle draft running out of context
8be14e40
ggerganov server : fix mtmd draft processing
6a4b05a0
ggerganov server : fix URL for draft model
12c7cfbe
github-actions github-actions added python
am17an
ggerganov
ggerganov server : add comment
233d1aee
pwilkin
ggerganov server : clean-up + dry
3b1a8df8
ggerganov speculative-simple : update
e5b14013
ggerganov
am17an
am17an commented on 2026-05-08
am17an
am17an commented on 2026-05-08
ggerganov spec : fix n_past type
161eae0a
ggerganov server : fix slot ctx_drft ptr
1dbc054d
ggerganov tools : update readme
778f9e24
ggerganov ggerganov requested a review from ngxson ngxson 43 days ago
am17an
am17an
am17an commented on 2026-05-08
ggerganov naming : improve consistency
efa2f8e5
ggerganov
ggerganov
ggerganov commented on 2026-05-08
ruixiang63
ggerganov
ruixiang63
ruixiang63 commented on 2026-05-08
ggerganov
ggerganov ggerganov closed this 40 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone