llama.cpp
0cb7a068 - opencl: add q8_0 mm support (#16469)

Commit
20 days ago
opencl: add q8_0 mm support (#16469) * opencl: add mm_q8_0_f32 * opencl: fix data loading for incomplete tile * opencl: use q8_0 mm for larger matrix * opencl: add some tests to cover the path
Author
Parents
Loading