text-generation-inference
bd2ec03d - backend(vllm): statically allocate LLMEngine

Commit
1 year ago
backend(vllm): statically allocate LLMEngine
Author
Parents
Loading