llama.cpp
52e5f0a5 - common : re-arm reasoning budget after DONE on new <think> (#22323)

Commit
23 days ago
common : re-arm reasoning budget after DONE on new <think> (#22323) DONE state absorbs all tokens including a new start tag, causing any think blocks after the first to run unbudgeted. Observed on unsloth/Qwen3.6-27B-GGUF which interleaves multiple <think> blocks per response. Fixed by advancing start_matcher in DONE branch and re-arming to COUNTING with a fresh budget on match. Adds regression test (test-reasoning-budget: test 6).
Author
Parents
Loading