llama.cpp
bb50a792
- Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041) * Add ReLU and SQR CUDA ops to fix Persimmon offloading * Persimmon loader: More helpful error on CUDA/ROCM when offloading too many layers
References
#4041 - Add ReLU and SQR CUDA ops to fix Persimmon offloading
Author
KerfuffleV2
Parents
21fd874c
Loading