llama.cpp
ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support
#21046
Open

ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support #21046

reeselevine wants to merge 5 commits into ggml-org:master from reeselevine:remove_bitcast
reeselevine
reeselevine Work towards removing bitcast
f1eb80ef
reeselevine Move rest of existing types over
e9af481e
reeselevine Add timeout back to wait and remove synchronous set_tensor/memset_tensor
b3aa3be8
reeselevine move to unpackf16 for wider compatibility
67fe0897
reeselevine cleanup
e85e8bcc
reeselevine reeselevine requested a review 3 days ago
reeselevine reeselevine changed the title Move quantized buffers to u32 types and some other changes for wider browser/device support ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support 3 days ago
github-actions github-actions added ggml
github-actions github-actions added WebGPU

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone