vllm
86fc2321
- [Metrics] Add bucket for `request_latency`, `time_to_first_token` and `time_per_output_token` (#15202)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[Metrics] Add bucket for `request_latency`, `time_to_first_token` and `time_per_output_token` (#15202) Signed-off-by: Kay Yan <kay.yan@daocloud.io>
References
#15202 - [Metrics] Add bucket for `request_latency`, `time_to_first_token` and `time_per_output_token`
Author
yankay
Parents
2549c0df
Loading