llama.cpp
061f5f8d
- CUDA: add __restrict__ to mul mat vec kernels (#2140)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
CUDA: add __restrict__ to mul mat vec kernels (#2140)
References
#2140 - 5.5x more CUDA performance with 5 minutes of work
Author
JohannesGaessler
Parents
84525e79
Loading