llama.cpp
bdcb8f42
- CUDA: int8 tensor cores for MMQ (q4_K, q5_K, q6_K) (#7860)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
CUDA: int8 tensor cores for MMQ (q4_K, q5_K, q6_K) (#7860)
References
#7860 - CUDA: int8 tensor cores for MMQ (q4_K, q5_K, q6_K)
Author
JohannesGaessler
Parents
c2ce6c47
Loading