llama.cpp
a15ef8f8 - CUDA: fix partial offloading for ne0 % 256 != 0 (#8572)

Commit
328 days ago
CUDA: fix partial offloading for ne0 % 256 != 0 (#8572)
Parents
  • ggml
    • include
      • File
        ggml-backend.h
    • src
      • File
        ggml-alloc.c
      • File
        ggml-backend.c
      • File
        ggml-cuda.cu