llama.cpp
bb50a792 - Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041)

Commit
1 year ago
Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041) * Add ReLU and SQR CUDA ops to fix Persimmon offloading * Persimmon loader: More helpful error on CUDA/ROCM when offloading too many layers
Author
Parents
Loading