text-generation-inference
bd2ec03d
- backend(vllm): statically allocate LLMEngine
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
backend(vllm): statically allocate LLMEngine
Author
mfuntowicz
Parents
cfd22726
Loading