llama.cpp
0a423800
- CUDA: revert part of the RDNA1 optimizations (#8309)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
342 days ago
CUDA: revert part of the RDNA1 optimizations (#8309) The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s
References
#8309 - CUDA: revert part of the RDNA1 optimizations
Author
daniandtheweb
Parents
d12f7810
Files
1
ggml/src/ggml-cuda
mmq.cuh
Loading