llama.cpp
3b099bcd - CUDA: fix MMQ writeback for int8 tensor cores (#8100)

Commit
1 year ago
CUDA: fix MMQ writeback for int8 tensor cores (#8100)
Parents
Loading