llama.cpp
91544948 - CUDA: mul_mat_id always on GPU for batches >= 32 (#4553)

Commit
1 year ago
CUDA: mul_mat_id always on GPU for batches >= 32 (#4553)
Parents
  • File
    ggml-cuda.cu