vllm
8f4824b6 - [Model Runner V2] Gather multimodal embeddings before draft model postprocess (#37932)

Commit
15 days ago
[Model Runner V2] Gather multimodal embeddings before draft model postprocess (#37932) Signed-off-by: Giancarlo Delfin <gdelfin@inferact.ai>
Parents
Loading