llama.cpp
cb82adad - metal : first working version of the inference without prompt processing

Commit
2 years ago
metal : first working version of the inference without prompt processing Bonus: supports partial inference on the CPU
Author
Parents
Loading