llama.cpp
f64d44a9 - CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)

Commit
2 years ago
CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)
Parents
Loading