llama.cpp
0a423800 - CUDA: revert part of the RDNA1 optimizations (#8309)

Commit

1 year ago

CUDA: revert part of the RDNA1 optimizations (#8309) The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

References

Author

daniandtheweb

Parents