vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations #11595
vulkan: implement specialized MMV kernels for IQ2 quantizations
b80033ef
vulkan: add MMV kernels for IQ3 quants
e3228c74
vulkan: Increase MMV batch size and unroll IQ LUT setup
c263f8f3
vulkan: fix init_iq_shmem for WG sizes larger than tables
8608322f
remyoudompheng
marked this pull request as ready for review 1 year ago
vulkan: common batch size for all I-quants
cfea4ddb
0cc4m
approved these changes
on 2025-02-28
0cc4m
merged
438a8392
into master 1 year ago
Assignees
No one assigned
Labels
Vulkan
devops
ggml
Login to write a write a comment.
Login via GitHub