llama.cpp
ci: bench: support sse and fix prompt processing time / server: add tokens usage in stream OAI response
#6495
Merged

ci: bench: support sse and fix prompt processing time / server: add tokens usage in stream OAI response #6495

phymbert merged 6 commits into master from hp/server/bench/sse
phymbert
phymbert ci: bench: support sse and fix prompt processing time
713fa986
phymbert phymbert requested a review from ggerganov ggerganov 2 years ago
phymbert phymbert requested a review from ngxson ngxson 2 years ago
phymbert ci: bench: README.md EOL
1534d903
phymbert ci: bench: remove total pp and tg as it is not accurate
36940266
phymbert ci: bench: fix case when there is no token generated
59dc4bbb
phymbert ci: bench: change to the 95 percentile for pp and tg as it is closer …
b6b50b11
phymbert ci: bench: fix finish reason rate
8789e17f
ngxson
ngxson approved these changes on 2024-04-04
phymbert phymbert merged 75cd4c77 into master 2 years ago
phymbert phymbert deleted the hp/server/bench/sse branch 2 years ago
ggerganov
phymbert
ggerganov
phymbert
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone