llama.cpp
ggml-webgpu: address precision issues for multimodal
#22808
Merged

ggml-webgpu: address precision issues for multimodal #22808

Constannnnnt
Constannnnnt fix(mixed-types): use f32 for precision and update the shared memory …
210766ce
Constannnnnt fix(unary): correct the gelu, gelu quick and gelu erf functions
2380a062
Constannnnnt fix(flash-attn-tile): fix the hardcode v type
f5f940fc
Constannnnnt fix(flash_attn): fix tile path
f7b1560d
Constannnnnt Merge branch 'master' of https://github.com/ggml-org/llama.cpp into w…
ef301b66
Constannnnnt Constannnnnt requested a review 40 days ago
github-actions github-actions added ggml
github-actions github-actions added WebGPU
reeselevine
reeselevine commented on 2026-05-08
reeselevine reeselevine changed the title ggml-webgpu: address precision issues for multimodel ggml-webgpu: address precision issues for multimodal 39 days ago
Constannnnnt
reeselevine
Constannnnnt Merge branch 'master' of https://github.com/ggml-org/llama.cpp into w…
2fb28310
Constannnnnt fix: pass editorconfig and address the type conflicts
d2bd5ebd
Constannnnnt fix: remove reduant pipeline keys
ab59cd5a
Constannnnnt fix: remove inline min/max group size functions and revert the flash …
5a6f9c63
Constannnnnt Merge branch 'master' of https://github.com/ggml-org/llama.cpp into w…
ffa63523
Constannnnnt
Constannnnnt Merge branch 'master' of https://github.com/ggml-org/llama.cpp into w…
8c5597f2
Constannnnnt fix: use clamp to avoid NaN for GELU
e115d544
Constannnnnt fix: use the right range for exp, 80 is safer for f32 exp
6797d633
reeselevine
reeselevine approved these changes on 2026-05-12
reeselevine reeselevine requested a review from CISC CISC 36 days ago
reeselevine reeselevine requested a review from ggerganov ggerganov 36 days ago
CISC
CISC approved these changes on 2026-05-12
reeselevine reeselevine merged 239a497e into master 35 days ago
Constannnnnt Constannnnnt deleted the webgpu/fix-mul-max branch 35 days ago
ArberSephirotheca
Constannnnnt
Constannnnnt
reeselevine
Constannnnnt

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone