llama.cpp
cb5fad4c - CUDA: refactor and optimize IQ MMVQ (#8215)

Commit
1 year ago
CUDA: refactor and optimize IQ MMVQ (#8215) * CUDA: refactor and optimize IQ MMVQ * uint -> uint32_t * __dp4a -> ggml_cuda_dp4a * remove MIN_CC_DP4A checks * change default * try CI fix
Parents
  • ggml/src
    • File
      ggml-common.h
    • File
      ggml-cuda.cu
    • ggml-cuda
      • File
        common.cuh
      • File
        fattn-common.cuh
      • File
        mmvq.cu
      • File
        vecdotq.cuh
    • ggml-sycl
      • File
        mmvq.cpp
      • File
        vecdotq.hpp