llama.cpp
Add ReLU and SQR CUDA ops to fix Persimmon offloading
#4041
Merged

Loading