llama.cpp
ggml webgpu: faster normal quant and some k-quant matrix operations, better shader parameter handling
#20173
Merged

ggml webgpu: faster normal quant and some k-quant matrix operations, better shader parameter handling #20173

reeselevine merged 6 commits into ggml-org:master from reeselevine:master
reeselevine
reeselevine K quant speedup (#20)
52058f3b
reeselevine Move towards writeBuffer for params
3a0d3e1b
reeselevine Move away from multiple buffers for set_rows errors, remove host buff…
efab3dfb
reeselevine Merge remote-tracking branch 'upstream/master'
d77731c2
reeselevine Remove extra file
02cac094
github-actions github-actions added ggml
CISC
CISC approved these changes on 2026-03-06
reeselevine Formatting
1dbdc5b8
nikhilJain17
nikhilJain17 approved these changes on 2026-03-09
reeselevine reeselevine merged aa2d278a into master 12 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone