llama.cpp
Improve cuBLAS performance by using a memory pool
#1094
Merged

Improve cuBLAS performance by using a memory pool #1094

slaren merged 4 commits into ggml-org:master from slaren:cuda-pool
slaren
slaren Improve cuBLAS performance by using a memory pool
e8797a9a
slaren Move cuda specific definitions to ggml-cuda.h/cu
641e9a0c
slaren Add CXX flags to nvcc
c832e7c7
glinscott
dfyz
dfyz commented on 2023-04-21
slaren
SlyEcho
dfyz
ggerganov
ggerganov
ggerganov approved these changes on 2023-04-21
SlyEcho
slaren
slaren
SlyEcho
slaren
slaren
slaren Change memory pool synchronization mechanism to a spin lock
d774e054
slaren
slaren slaren merged 50cb666b into master 2 years ago
slaren slaren deleted the cuda-pool branch 2 years ago
Dampfinchen
slaren
dfyz
slaren
SlyEcho
slaren
dfyz
slaren
dfyz
slaren
SlyEcho
slaren
SlyEcho

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone