vllm
c34ba6b9 - [Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement (#36710)

Commit
43 days ago
[Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement (#36710) Signed-off-by: yewentao256 <zhyanwentao@126.com>
Author
Parents
Loading