llama.cpp
663445b0 - sycl: quantize and reorder the input to q8_1 when reorder is enabled (#13826)

Commit
101 days ago
sycl: quantize and reorder the input to q8_1 when reorder is enabled (#13826) * [WIP]: fuse q8 quantization and reorder * wip2: fuse q8 quantization and reorder * working q8 reorder commit * restored common.hpp * remove debug prints * remove unnecessary headers and remove trailing whitespace * Update ggml/src/ggml-sycl/ggml-sycl.cpp Co-authored-by: Alberto Cabrera Pérez <alberto.cabrera@intel.com> --------- Co-authored-by: Alberto Cabrera Pérez <alberto.cabrera@intel.com>
Author
Parents
Loading