vllm
e526b1c0
- fix num_tokens_across_dp sizing issue
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
252 days ago
fix num_tokens_across_dp sizing issue Signed-off-by: Sage Moore <sage@neuralmagic.com>
References
#23693 - [Core/DBO][1/N] Add Dual-Batch Overlap mechanism to VLLM
Author
SageMoore
Committer
SageMoore
Parents
44ead56a
Loading