llama.cpp
bcfebf24
- metal : add F32 -> Q8_0 copy kernel
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
metal : add F32 -> Q8_0 copy kernel
References
#4312 - llama : support quantum K cache
Author
ggerganov
Parents
d04ee928
Loading