llama.cpp
`tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars
#12034
Merged

`tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars #12034

ochafik merged 48 commits into ggml-org:master from ochafik:tool-bench-prod
ochafik
sampler: turn lazy grammar trigger words to regexes
b37779b9
add scripts/tool_bench.sh & .py
a4569111
optionally allow any spaces in json schema grammars (useful for llama…
14a43888
constrain llama json output regardless of function name if matches at…
e2ca8be6
better error when wrong function called
53266f9a
github-actions github-actions added script
github-actions github-actions added testing
github-actions github-actions added examples
github-actions github-actions added python
github-actions github-actions added server
GuuD
improve error message in weather test
7833c167
add more models to tool_bench.sh
0e1a00ec
ochafik
benchmark other sizes of qwen 2.5 coder
44740f7c
rm duplicate in tool_bench.sh
dd6eb97b
add missing <variant> include
0fc62182
fix lints
6fd4972a
improve "bad" qwen triggers
2e656f9f
add cast to please some gccs
fbd3c197
ditch server test request retry logic
62a1416a
fix flake8 lints
596ff7f3
nits
fe6968f3
remove any_spaces grammar option, allow extra line for airy llama jso…
1caacd5b
Update test_tool_call.py
789a3e1c
test w/ beefier qwen 2.5 coder 3b
6493a14b
revert some test_hello_world diffs
cc817a0a
diff
ead02c6d
Update test_tool_call.py
d7acf2c2
add requirements for tool_bench
0db4073e
fix test_thoughts deepseek test expectation
0ce606b9
Update README.md
a3cde169
update relaxed newline space rule in grammar tests
79ad6236
support add_generation_prompt query parameter (useful for /apply_temp…
3fe208a6
Merge remote-tracking branch 'origin/master' into tool-bench-prod
fe8c79b2
token cast tweak for gcc
99d2d802
fix warning on gcc13 w/ uninitialized variant
c7fa19ae
fix python lints
6e5a830f
ochafik ochafik marked this pull request as ready for review 296 days ago
ochafik ochafik requested a review from ngxson ngxson 296 days ago
fix gcc13 warning
0b5d1055
fix pyright lints in tool_bench.py
7bcc5af0
Merge remote-tracking branch 'origin/master' into tool-bench-prod
d1f48d03
update readme w/ link to tool call
fc19192f
tool-bench: add --ctk, --ctv, --fa flags
60f28ef6
Mushoz
ochafik
Mushoz
ggerganov
ggerganov
ggerganov approved these changes on 2025-03-03
ngxson
ngxson
ngxson approved these changes on 2025-03-04
Merge remote-tracking branch 'origin/master' into tool-bench-prod
2470a1c1
common_grammar_trigger: always use string value (+ optional token)
e6e9c138
add llama_grammar_trigger_pattern
5d43b726
add common_grammar_trigger.{to_json,from_json}
1317a35f
fix crashing typo
ad3caa34
avoid returning optional from parse_json
a6d78873
disable slow hello Llama-3.1-8B (chopped unescaped string witin strin…
20a2f5f8
ochafik
ochafik commented on 2025-03-05
fix nit eol at eof
92e9723d
ochafik
ochafik Update src/llama-grammar.cpp
01be080e
ggerganov
Merge remote-tracking branch 'origin/master' into tool-bench-prod
00db4651
ngxson
ngxson commented on 2025-03-05
ngxson
ochafik
avoid ggml_assert in server for grammar triggers inconsistency
24010fe7
add comment on limits to common_grammar_trigger.to/from json speciali…
71719a6e
ngxson
ngxson
ochafik
ngxson
ngxson
ngxson approved these changes on 2025-03-05
ochafik ochafik merged 669912d9 into master 288 days ago
codefromthecrypt

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone