llama.cpp
08d59862 - [SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035)

Commit

1 year ago

[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035) * opt performance by reorder for Intel GPU * detect hw type and save opt feature, and print opt feature * correct name * support optimize graph once when compute graph, record the opt status in tensor->extra, make CI passed * add env variable GGML_SYCL_DISABLE_OPT for debug * use syclex::architecture replace the custom hw define, update the guide for GGML_SYCL_DISABLE_OPT * add performance data * mv getrows functions to separeted files * fix global variables --------- Co-authored-by: arthw <14088817+arthw@users.noreply.github.com>

References

#12035 - [SYCL] Optimize mul_mat for Q4_0 on Intel GPU

Author

NeoZhangJianyu

Parents

651adf4b

llama.cpp 08d59862 - [SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035)

llama.cpp
08d59862 - [SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035)