vllm
86fc2321 - [Metrics] Add bucket for `request_latency`, `time_to_first_token` and `time_per_output_token` (#15202)

Commit
1 year ago
[Metrics] Add bucket for `request_latency`, `time_to_first_token` and `time_per_output_token` (#15202) Signed-off-by: Kay Yan <kay.yan@daocloud.io>
Author
Parents
Loading