llama.cpp
fbddb262
- ggml-cuda : use i and j instead of i0 and i in vec_dot_tq2_0_q8_1
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
341 days ago
ggml-cuda : use i and j instead of i0 and i in vec_dot_tq2_0_q8_1
References
compilade/cuda-tq2_0
#11183 - ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU
Author
compilade
Committer
compilade
Parents
b6fc9f03
Loading