llama.cpp
ggml-webgpu: Improve the mat-vec and mat-mat of MUL_MAT_ID
#22464
Merged

ggml-webgpu: Improve the mat-vec and mat-mat of MUL_MAT_ID #22464

yomaytk
yomaytk yomaytk requested a review 50 days ago
yomaytk
yomaytk commented on 2026-04-28
CISC
CISC approved these changes on 2026-04-28
github-actions github-actions added ggml
github-actions github-actions added WebGPU
yomaytk yomaytk force pushed from 3e8ec276 to 7c5cebea 50 days ago
yomaytk
yomaytk commented on 2026-04-28
yomaytk yomaytk force pushed from f2a55032 to 5466aa82 49 days ago
yomaytk
yomaytk
yomaytk commented on 2026-04-29
reeselevine
yomaytk
reeselevine
yomaytk Add mat-vec fast path of MUL_MAT_ID.
590fa483
yomaytk Add shared accumulation vec logic and the other types supports.
afe0c628
yomaytk yomaytk force pushed from 5466aa82 to 93995420 48 days ago
yomaytk Add i-quant mat-mat for MUL_MAT_ID and fix some parts
c0700cdf
yomaytk yomaytk force pushed from 66e74003 to c0700cdf 48 days ago
yomaytk
yomaytk yomaytk changed the title ggml-webgpu: Improve the mat-vec performance of MUL_MAT_ID ggml-webgpu: Improve the mat-vec and mat-mat of MUL_MAT_ID 48 days ago
reeselevine
reeselevine commented on 2026-04-30
yomaytk Remove n_experts from shader_lib_context.
a44c0fb9
reeselevine
reeselevine approved these changes on 2026-04-30
reeselevine reeselevine merged a95a11e5 into master 48 days ago
yomaytk yomaytk deleted the mul_mat_id_vec branch 47 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone