llama.cpp
50cb666b - Improve cuBLAS performance by using a memory pool (#1094)

Commit

3 years ago

Improve cuBLAS performance by using a memory pool (#1094) * Improve cuBLAS performance by using a memory pool * Move cuda specific definitions to ggml-cuda.h/cu * Add CXX flags to nvcc * Change memory pool synchronization mechanism to a spin lock General code cleanup

References

#1094 - Improve cuBLAS performance by using a memory pool

Author

slaren

Parents

25d7abbd

llama.cpp 50cb666b - Improve cuBLAS performance by using a memory pool (#1094)

llama.cpp
50cb666b - Improve cuBLAS performance by using a memory pool (#1094)