llama.cpp
`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false)
#13771
Merged

`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) #13771

ochafik merged 13 commits into ggml-org:master from ochafik:enable-thinking
ochafik
server: fix/test add_generation_prompt
8a25f796
tools: enable hermes2/qwen chat logic even w/o tools
43b5626e
server: add --reasoning-format=disabled to disable thinking (incl. qw…
b457f89e
github-actions github-actions added testing
github-actions github-actions added examples
github-actions github-actions added python
github-actions github-actions added server
Update README.md
df25e6bb
ochafik ochafik force pushed from 1b05d5ca to df25e6bb 139 days ago
Add models/templates/Qwen-Qwen3-0.6B.jinja
b6eb0a5c
ngxson
CISC
update --reasoning-format={disabled -> nothink} as suggested
cdea6a9b
ochafik
fix command r7b's nothink w/ official template
473c01e5
ochafik ochafik changed the title `server`: add `--reasoning-format=disabled` to disable thinking (incl. qwen3 w/ enable_thinking:false) `server`: add `--reasoning-format=nothink` to disable thinking (incl. qwen3 w/ enable_thinking:false) 139 days ago
ochafik ochafik marked this pull request as ready for review 139 days ago
ochafik ochafik requested a review from ngxson ngxson 139 days ago
ngxson
ngxson approved these changes on 2025-05-25
rewrite docs as list as suggested
6b9efe7b
ochafik Update common/chat.cpp
355b38c9
Merge branch 'enable-thinking' of github.com:ochafik/llama.cpp into e…
fe6022f4
const char* return for chat enum name helpers
8547fccf
ngxson
ngxson commented on 2025-05-25
switch to --reasoning-budget flag
9cdeebe8
Merge branch 'fix-gen-prompt' into enable-thinking
91623806
ngxson
ngxson approved these changes on 2025-05-25
ngxson ngxson changed the title `server`: add `--reasoning-format=nothink` to disable thinking (incl. qwen3 w/ enable_thinking:false) `server`: add `--reasoning-budget` to disable thinking (incl. qwen3 w/ enable_thinking:false) 138 days ago
ngxson ngxson changed the title `server`: add `--reasoning-budget` to disable thinking (incl. qwen3 w/ enable_thinking:false) `server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) 138 days ago
ochafik ochafik merged e121edc4 into master 138 days ago
countzero
kth8
countzero
characharm
jacekpoplawski
kth8
jacekpoplawski
jacekpoplawski

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone