llama.cpp
a818f302 - CUDA: use MMQ instead of cuBLAS by default (#8075)

Commit
350 days ago
CUDA: use MMQ instead of cuBLAS by default (#8075)
Parents
  • File
    CMakeLists.txt
  • File
    Makefile
  • File
    README.md
  • File
    ggml-cuda.cu
  • ggml-cuda
    • File
      common.cuh
    • File
      mmq.cu
    • File
      mmq.cuh
    • File
      mmvq.cuh