llama.cpp
465263d0
- sgemm : AVX Q4_0 and Q8_0 (#6891)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
sgemm : AVX Q4_0 and Q8_0 (#6891) * basic avx implementation * style * combine denibble with load * reduce 256 to 128 (and back!) conversions * sse load * Update sgemm.cpp * oops oops
References
#6891 - AVX Q4_0 and Q8_0 sgemm
Author
netrunnereve
Parents
911b3900
Loading