text-generation-inference
bb8f5963
- feat(metrics): exposes queue size as tokens along with individual requests count
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
327 days ago
feat(metrics): exposes queue size as tokens along with individual requests count
References
#3065 - Expose the real-time internal state of the batcher through SSE
Author
mfuntowicz
Parents
5eec3a8b
Loading