vllm
d48f4d6d
- perf: Avoid copying inputs_embeds tensors to GPU unless prompt_embeds is enabled (#25739)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
217 days ago
perf: Avoid copying inputs_embeds tensors to GPU unless prompt_embeds is enabled (#25739) Signed-off-by: Andrew Sansom <andrew@protopia.ai>
References
#25739 - perf: Avoid copying inputs_embeds tensors to GPU unless prompt_embeds is enabled
Author
qthequartermasterman
Parents
e84e0735
Loading