llama.cpp
fbddb262 - ggml-cuda : use i and j instead of i0 and i in vec_dot_tq2_0_q8_1

Commit

1 year ago

ggml-cuda : use i and j instead of i0 and i in vec_dot_tq2_0_q8_1

References

compilade/cuda-tq2_0

#11183 - ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU

Author

compilade

compilade

Committer

compilade

compilade

Parents

Loading