llama.cpp
8c5b66ee - metal : reduce the kernel launches for ggml_mul_mat_id

Commit

2 years ago

metal : reduce the kernel launches for ggml_mul_mat_id

References

#4406 - llama : add Mixtral support

Author

ggerganov

ggerganov

Parents

Loading