llama.cpp
69664749
- cuda : play with faster Q4_0 dequantization
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
cuda : play with faster Q4_0 dequantization
References
cuda-batched-gemm-deq
Author
ggerganov
Parents
d4156690
Loading