vllm
05a83dc6
- feat(api): Eager chat template warmup to eliminate first-request latency (#30700)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 days ago
feat(api): Eager chat template warmup to eliminate first-request latency (#30700) Signed-off-by: Nathan Price <nathan@abridge.com>
References
#30700 - feat(api): Eager chat template warmup to eliminate first-request latency
Author
TheCodeWrangler
Parents
e3fc374a
Loading