server : implement prompt processing progress report in stream mode #15827
server : implement `return_progress`
29f1d50e
add timings.cache_n
4404ad86
add progress.time_ms
e166a550
add test
f4213ccd
fix test for chat/completions
ebcef910
ngxson
marked this pull request as ready for review 103 days ago
readme: add docs on timings
b6ac24c6
ggerganov
approved these changes
on 2025-09-06
use ggml_time_us
053dc6b3
ngxson
merged
61bdfd52
into master 103 days ago
ngxson
deleted the xsn/server_progress_api branch 74 days ago
Assignees
No one assigned
Labels
examples
python
server
Login to write a write a comment.
Login via GitHub