llama.cpp
0a423800
- CUDA: revert part of the RDNA1 optimizations (#8309)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
CUDA: revert part of the RDNA1 optimizations (#8309) The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s
References
#8309 - CUDA: revert part of the RDNA1 optimizations
Author
daniandtheweb
Parents
d12f7810
Loading