llama.cpp
f64d44a9 - CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)

Commit

2 years ago

CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)

References

#2590 - CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time

Author

JohannesGaessler

JohannesGaessler

Parents

Loading