whisper.cpp
b17ba281 - CUDA: faster q2_K, q3_K MMQ + int8 tensor cores (llama/7921)

Commit
1 year ago
CUDA: faster q2_K, q3_K MMQ + int8 tensor cores (llama/7921) * CUDA: faster q2_K, q3_K MMQ + int8 tensor cores * try CI fix * try CI fix * try CI fix * fix data race * rever q2_K precision related changes
Committer
Parents
Loading