llama.cpp
CUDA memory pool with async memory allocation/deallocation
#3903
Merged

CUDA memory pool with async memory allocation/deallocation #3903

young-developer
Using cuda memory pools for async alloc/dealloc.
08868a44
If cuda device doesnt support memory pool than use old implementation.
7e6f4132
young-developer young-developer changed the title CUDA memory pool with async memory allocation deallocation CUDA memory pool with async memory allocation/deallocation 1 year ago
slaren
slaren commented on 2023-11-02
Removed redundant cublasSetStream
587ff3bf
slaren
slaren
slaren approved these changes on 2023-11-02
young-developer
slaren
ggerganov
ggerganov approved these changes on 2023-11-02
young-developer
slaren
young-developer
ggerganov ggerganov merged d6069051 into master 1 year ago
cebtenzzre
young-developer
young-developer young-developer deleted the cuda-memory-pool branch 1 year ago
young-developer

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone