CUDA performance optimization: asynchronous computation by using only one cudaStream #1898
slaren
approved these changes
on 2023-06-16
Only one CUDA stream per device for async compute
8a93a05a
ggerganov
approved these changes
on 2023-06-17
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub