llama.cpp
ggml-webgpu: updated matrix-vector multiplication
#21738
Merged

ggml-webgpu: updated matrix-vector multiplication #21738

neha-ha
merged properly, but slow q3_k and q5_k with u32 indexing
3c36b556
neha-ha neha-ha requested a review from ggerganov ggerganov 72 days ago
neha-ha neha-ha requested a review 72 days ago
github-actions github-actions added ggml
github-actions github-actions added WebGPU
yomaytk
reeselevine
yomaytk
reeselevine Start on new mat-vec
3c9e474c
reeselevine New format float paths working
0bcf75c1
reeselevine Working q4_0
01bd9127
reeselevine Work on remaining legacy q-types
f839c103
reeselevine port k-quants to new matvec
ba961225
reeselevine remove old shader
b4b6ffc4
reeselevine Merge remote-tracking branch 'upstream/master' into k_quant_speedup
83a0d381
reeselevine reeselevine force pushed from 41259410 to 83a0d381 65 days ago
reeselevine
reeselevine Remove old constants, format
ca49e73a
reeselevine
reeselevine approved these changes on 2026-04-17
reeselevine reeselevine requested a review from CISC CISC 65 days ago
reeselevine reeselevine added merge ready
CISC
CISC approved these changes on 2026-04-17
reeselevine remove accidental file
b92011ef
reeselevine
reeselevine approved these changes on 2026-04-19
ggerganov
ggerganov approved these changes on 2026-04-20
reeselevine reeselevine merged a6cc43c2 into master 62 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone