llama.cpp
f64d44a9
- CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)
References
#2590 - CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time
Author
JohannesGaessler
Parents
b19edd54
Loading