llama.cpp
bb50a792 - Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041)

Commit

2 years ago

Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041) * Add ReLU and SQR CUDA ops to fix Persimmon offloading * Persimmon loader: More helpful error on CUDA/ROCM when offloading too many layers

References

#4041 - Add ReLU and SQR CUDA ops to fix Persimmon offloading

Author

KerfuffleV2

Parents

21fd874c

llama.cpp bb50a792 - Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041)

llama.cpp
bb50a792 - Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041)