llama.cpp
908a9e5a
- CUDA: disable cuda graph when using n-cpu-moe (#18593)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
7 days ago
CUDA: disable cuda graph when using n-cpu-moe (#18593) * CUDA: disable cuda graph when using n-cpu-moe * call ggml_cuda_set_device
References
#18593 - CUDA: disable cuda graph when using n-cpu-moe
Author
am17an
Parents
5126c41c
Loading