llama.cpp
908a9e5a - CUDA: disable cuda graph when using n-cpu-moe (#18593)

Commit
7 days ago
CUDA: disable cuda graph when using n-cpu-moe (#18593) * CUDA: disable cuda graph when using n-cpu-moe * call ggml_cuda_set_device
Author
Parents
Loading