vllm
cb10b7e8
- [GDN] Eliminate GPU->CPU sync in prepare_chunk_indices during prefill (#38361)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
18 days ago
[GDN] Eliminate GPU->CPU sync in prepare_chunk_indices during prefill (#38361) Signed-off-by: Artem Perevedentsev <aperevedents@nvidia.com> Signed-off-by: Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
References
#38361 - [GDN] Eliminate GPU->CPU sync in prepare_chunk_indices during prefill
Author
arpera
Parents
bf8b022e
Loading