llama.cpp
vulkan: initial support for IQ1_S and IQ1_M quantizations
#11528
Merged

vulkan: initial support for IQ1_S and IQ1_M quantizations #11528

0cc4m merged 4 commits into ggml-org:master from remyoudompheng:vulkan-iq1
remyoudompheng
github-actions github-actions added Vulkan
github-actions github-actions added ggml
jeffbolznv
jeffbolznv commented on 2025-01-30
github-actions github-actions added devops
remyoudompheng
remyoudompheng
jeffbolznv
remyoudompheng remyoudompheng force pushed 1 year ago
remyoudompheng remyoudompheng force pushed 1 year ago
remyoudompheng remyoudompheng marked this pull request as ready for review 1 year ago
remyoudompheng
0cc4m
0cc4m commented on 2025-02-10
remyoudompheng vulkan: initial support for IQ1_S and IQ1_M quantizations
73463b4b
remyoudompheng vulkan: define MMV kernels for IQ1 quantizations
14e65cb0
remyoudompheng devops: increase timeout of Vulkan tests again
b9af7008
remyoudompheng vulkan: simplify ifdef for init_iq_shmem
431b61d2
remyoudompheng remyoudompheng force pushed to 431b61d2 1 year ago
remyoudompheng
netrunnereve
0cc4m
0cc4m approved these changes on 2025-02-15
0cc4m 0cc4m merged fc1b0d09 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone