vulkan: im2col and matmul optimizations for stable diffusion #10942
tests: Add im2col perf tests
20744981
vulkan: optimize im2col, more elements per thread
26252831
vulkan: increase small tile size for NV_coopmat2
e52a0f28
vulkan: change im2col to 512 elements per workgroup
70676c35
0cc4m
approved these changes
on 2024-12-29
0cc4m
merged
a813badb
into master 285 days ago
Assignees
No one assigned
Labels
testing
Vulkan
ggml
Login to write a write a comment.
Login via GitHub