llama.cpp
3fe81781 - CUDA: faster q8_0 -> f16 dequantization (#4895)

Commit
1 year ago
CUDA: faster q8_0 -> f16 dequantization (#4895)
Parents
Loading