llama.cpp
93034df7 - iq3_s_mult_shuffle: use lookup table on CUDA

Commit
1 year ago
iq3_s_mult_shuffle: use lookup table on CUDA ~4% faster TG that way.
Author
Iwan Kawrakow
Parents
Loading