llama.cpp
server: benchmark: chat/completions scenario and other llm servers comparison
#5941
Merged

server: benchmark: chat/completions scenario and other llm servers comparison #5941

phymbert merged 15 commits into master from hp/server/bench/init
phymbert
phymbert server: bench: Init a bench scenario with K6
68d1d8fe
phymbert phymbert changed the title server: bench: Init a bench scenario with K6 server: bench: scenario with K6 1 year ago
phymbert server: bench: EOL EOF
0b822b6a
phymbert phymbert marked this pull request as ready for review 1 year ago
phymbert phymbert requested a review from ggerganov ggerganov 1 year ago
phymbert phymbert requested a review from ngxson ngxson 1 year ago
phymbert
phymbert phymbert changed the title server: bench: scenario with K6 server: benchmark: chat/completions scenario and other llm servers comparison 1 year ago
ngxson
ngxson requested changes on 2024-03-08
ngxson
phymbert
ggerganov
phymbert server: bench: PR feedback and improved k6 script configuration
548bc963
phymbert
phymbert server: bench: remove llamacpp_completions_tokens_seconds as it inclu…
ab0a59d6
phymbert server: bench: fix doc
f425240e
phymbert server: bench: change gauge custom metrics to trend
bed1cdda
phymbert server: bench: change gauge custom metrics to trend
572758a6
phymbert
phymbert server: bench: doc add an option to debug http request
06e225f8
phymbert server: bench: filter dataset too short and too long sequences
a4b0d107
phymbert server: bench: allow to filter out conversation in the dataset based …
29c635b4
phymbert server: bench: fix assistant message sent instead of user message
ba7114c0
phymbert server: bench: fix assistant message sent instead of user message
c4d1b5aa
ggerganov Merge branch 'master' into hp/server/bench/init
5d25f748
ggerganov server : add defrag thold parameter
52c76d57
ggerganov
phymbert
phymbert
ggerganov
phymbert
ggerganov
ggerganov
ggerganov approved these changes on 2024-03-09
phymbert
ggerganov
ngxson
ngxson approved these changes on 2024-03-09
phymbert
phymbert server: bench: select prompts based on the current iteration id not r…
6bfb80eb
phymbert phymbert merged 621e86b3 into master 1 year ago
phymbert phymbert deleted the hp/server/bench/init branch 1 year ago
phymbert
phymbert
ngxson
ngxson
phymbert

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone