vllm
c34ba6b9
- [Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement (#36710)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
43 days ago
[Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement (#36710) Signed-off-by: yewentao256 <zhyanwentao@126.com>
References
#36710 - [Perf] Optimize compute maxsim using batched version, 3.2% E2E throughput improvement
Author
yewentao256
Parents
24062b70
Loading