llama.cpp
ggml WebGPU: add support for quantization types
#15440
Merged

ggml WebGPU: add support for quantization types #15440

reeselevine merged 25 commits into ggml-org:master from reeselevine:master
reeselevine
reeselevine Begin work on set_rows
6a6135cc
reeselevine Work on set rows
b2dbfcdc
reeselevine Add error buffers for reporting unsupported SET_ROWS indices
248f7a51
reeselevine Remove extra comments
4ad09861
reeselevine Work on templating for different types in shaders
6355137c
reeselevine Work on shader type generation
831ea3c3
reeselevine Working q4_0 mul_mat and some templating for different types
688b51db
reeselevine Add q4_0_f16 matmul and fix device init
1aa40f1a
reeselevine Add matmul support for basic quantization types
c3611f9d
reeselevine Add q2_k and q3_k quantization
de4da871
reeselevine Add rest of k-quants
d76e562b
reeselevine Get firt i-quant working
e2380e25
reeselevine Closer to supporting all i-quants
2a3b9ee2
reeselevine Support rest of i-quants
57c26b17
reeselevine Merge remote-tracking branch 'origin/master' into types
7a2ae489
reeselevine Cleanup code
51252f02
reeselevine Merge pull request #2 from reeselevine/types
985508e5
github-actions github-actions added python
github-actions github-actions added ggml
reeselevine Fix python formatting
6552e2e4
slaren
reeselevine debug
65bebd31
reeselevine
slaren
slaren
reeselevine Bugfix for memset
16df269a
reeselevine Add padding to end of buffers on creation
10babfd7
reeselevine Simplify bit-shifting
7a323b08
reeselevine Merge pull request #3 from reeselevine/fixes
d1b0ffe9
reeselevine Update usage of StringView
d6903035
reeselevine
ggerganov
ggerganov approved these changes on 2025-08-22
reeselevine Merge remote-tracking branch 'upstream/master'
1fcc4047
reeselevine reeselevine merged 45363632 into master 140 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone