llama.cpp
Kimi Linear chunk size = 16
#19827
Merged

Kimi Linear chunk size = 16 #19827

ggerganov merged 26 commits into ggml-org:master from ymcki:dn
ymcki
ggerganov models : add llm_build_delta_net_base
fa44de74
ggerganov cont : keep qwen35 and qwen35moe graphs intact
403e78ea
ggerganov cont : add comments [no ci]
2371dfb5
ymcki add kimi linear to delta-net-base
cff8f601
ymcki sync to b8057
b0594c9f
ymcki removed unnecessary ggml_cont from g_exp_t
6c765ebf
ymcki removed ggml_cont from g_diff_exp_t. moved ggml_cont for o to kimi-li…
a93bcc4d
ymcki removed unnecessary diag mask
7b26805f
ggerganov cont : simplify
4a6393ef
ggerganov cont : avoid graph splits
c07977ae
ymcki scale q after mul instead of beginning
117763ed
ymcki Merge branch 'ggml-org:master' into dn
6dad4378
ymcki Merge branch 'master' of github.com:ymcki/llama.cpp into dn
df269dc7
ymcki scale q after mul instead of beginning
1cea24dc
ymcki scale q after mul instead of beginning
a6fa6d50
ymcki identical ppl
6432f95b
ggerganov cont : fix scale and decay mask
de6a8420
ggerganov minor : remove TODO
23cccea2
ymcki block implementation for kda
18d7b2ca
ymcki block implementation for kda
09f0baf7
ymcki ymcki requested a review from CISC CISC 42 days ago
github-actions github-actions added model
CISC
CISC commented on 2026-02-23
CISC CISC requested a review from ggerganov ggerganov 42 days ago
ymcki remove space at the end of line 101
ac46b38d
ymcki concat+pad
ec25a26c
ymcki
ymcki pad+binary row concat
8c96f826
ymcki
ymcki chunk size 16 for kda
aa5b8169
ymcki
ymcki ymcki changed the title Kimi Linear block implementation Kimi Linear chunk size = 16 35 days ago
ymcki Merge branch 'ggml-org:master' into dn
099645a7
pwilkin
ymcki
ggerganov
ggerganov commented on 2026-03-02
ymcki removed minor differences to master
f90c5850
ggerganov
ggerganov approved these changes on 2026-03-02
ggerganov ggerganov merged a0ed91a4 into master 32 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone