vllm
cb10b7e8 - [GDN] Eliminate GPU->CPU sync in prepare_chunk_indices during prefill (#38361)

Commit
18 days ago
[GDN] Eliminate GPU->CPU sync in prepare_chunk_indices during prefill (#38361) Signed-off-by: Artem Perevedentsev <aperevedents@nvidia.com> Signed-off-by: Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
Author
Parents
Loading