llama.cpp
sycl-exp : dequant q4 k improvements
#7972
Merged

Commits
  • Remove double lines
    Aidan committed 1 year ago
  • Single load for half2
    Aidan committed 1 year ago
  • Store scales in local mem
    Aidan committed 1 year ago
  • Vectorize q load
    Aidan committed 1 year ago
Loading