llama.cpp
bdcb8f42 - CUDA: int8 tensor cores for MMQ (q4_K, q5_K, q6_K) (#7860)

Commit
1 year ago
CUDA: int8 tensor cores for MMQ (q4_K, q5_K, q6_K) (#7860)
Parents
Loading