vulkan: optimizations for deepseek prompt processing #14555
vulkan: allow unclamped loads in coopmat2 mul_mat_id shader
129a0f17
vulkan: increase coopmat2 mul_mat_id tile size
b54ddba8
vulkan: optimize mat_mul_id row_ids search to batch loads, and port t…
2b540868
vulkan: use smaller FA row size when head size is large. applies to b…
bd8e0bfb
0cc4m
approved these changes
on 2025-07-12
0cc4m
merged
98197e5c
into master 91 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub