llama.cpp
Add support for SmallThinker model series
#14898
Merged

Add support for SmallThinker model series #14898

CISC merged 18 commits into ggml-org:master from wdl339:smallthinker
wdl339
wdl339 support smallthinker
efe27eb5
wdl339 support 20b softmax, 4b no sliding window
a6d6eafe
wdl339 Merge branch 'master' into smallthinker
a5274b79
wdl339 new build_moe_ffn_from_probs, and can run 4b
8e2cb21f
wdl339 fix 4b rope bug
e28d2c56
wdl339 Merge branch 'master' into smallthinker
ebd78ccc
wdl339 fix python type check
8c6af02a
github-actions github-actions added python
wdl339 remove is_moe judge
92b518b4
ggerganov
ggerganov commented on 2025-07-27
wdl339 remove set_dense_start_swa_pattern function and modify set_swa_patter…
4186babc
wdl339 trim trailing whitespace
f1d4698f
wdl339 wdl339 marked this pull request as ready for review 145 days ago
CISC
CISC requested changes on 2025-07-27
wdl339 remove get_vocab_base of SmallThinkerModel in convert_hf_to_gguf.py
f10cd467
wdl339 better whitespace
4af8b591
wdl339 use GGML_ASSERT for expert count validation
e2c900ce
wdl339 Improve null pointer check for probs
594af993
wdl339 use template parameter for SWA attention logic
29e1fe0a
ggerganov
ggerganov commented on 2025-07-28
wdl339 better whitespace
5d09d11b
wdl339 move the creation of inp_out_ids before the layer loop
bb3dd583
wdl339 remove redundant judge for probs
e338c30c
CISC
CISC approved these changes on 2025-07-28
CISC CISC merged 6c6e397a into master 144 days ago
arch-btw
CISC
wdl339
arch-btw
wdl339
CISC
TechnotechGit
arch-btw
arch-btw
wdl339
wdl339
TechnotechGit
arch-btw
wdl339
wdl339
arch-btw
CISC
arch-btw
CISC
arch-btw
wdl339
arch-btw

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone