llama.cpp
vulkan: handle quantize_q8_1 overflowing the max workgroup count
#18515
Merged

vulkan: handle quantize_q8_1 overflowing the max workgroup count #18515

0cc4m merged 3 commits into ggml-org:master from jeffbolznv:issue_18510
jeffbolznv
jeffbolznv vulkan: handle quantize_q8_1 overflowing the max workgroup count
9b1b9618
jeffbolznv jeffbolznv requested a review from ggerganov ggerganov 85 days ago
jeffbolznv jeffbolznv requested a review from 0cc4m 0cc4m 85 days ago
jeffbolznv
jeffbolznv commented on 2025-12-31
github-actions github-actions added testing
github-actions github-actions added Vulkan
github-actions github-actions added ggml
jeffbolznv vulkan: Fix small tile size matmul on lavapipe
cdf2427a
jeffbolznv
jeffbolznv commented on 2026-01-01
characharm
jeffbolznv fix mul_mat_id failures
cd53729e
0cc4m
0cc4m approved these changes on 2026-01-02
0cc4m 0cc4m merged b37124d2 into master 80 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone