llama.cpp
f0877604
- add q8_0 q4_0 tests
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
add q8_0 q4_0 tests
References
#7527 - CUDA: quantized KV support for FA vec
Author
JohannesGaessler
Committer
JohannesGaessler
Parents
3194a010
Loading