llama.cpp
cb82adad
- metal : first working version of the inference without prompt processing
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
metal : first working version of the inference without prompt processing Bonus: supports partial inference on the CPU
Author
ggerganov
Parents
290cb700
Loading