llama.cpp
8c5b66ee
- metal : reduce the kernel launches for ggml_mul_mat_id
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
metal : reduce the kernel launches for ggml_mul_mat_id
References
#4406 - llama : add Mixtral support
Author
ggerganov
Parents
7e2006b0
Loading