llama.cpp
vulkan: im2col and matmul optimizations for stable diffusion
#10942
Merged

vulkan: im2col and matmul optimizations for stable diffusion #10942

0cc4m merged 4 commits into ggml-org:master from jeffbolznv:im2col
jeffbolznv
jeffbolznv tests: Add im2col perf tests
20744981
jeffbolznv vulkan: optimize im2col, more elements per thread
26252831
jeffbolznv vulkan: increase small tile size for NV_coopmat2
e52a0f28
jeffbolznv jeffbolznv requested a review from 0cc4m 0cc4m 293 days ago
github-actions github-actions added testing
github-actions github-actions added Vulkan
github-actions github-actions added ggml
daniandtheweb
0cc4m
jeffbolznv vulkan: change im2col to 512 elements per workgroup
70676c35
jeffbolznv
0cc4m
0cc4m
0cc4m approved these changes on 2024-12-29
0cc4m 0cc4m merged a813badb into master 285 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone