llama.cpp
ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support
#21046

Open

ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support #21046

reeselevine wants to merge 5 commits into ggml-org:master from reeselevine:remove_bitcast

Work towards removing bitcast

f1eb80ef

Move rest of existing types over

e9af481e

Add timeout back to wait and remove synchronous set_tensor/memset_tensor

b3aa3be8

move to unpackf16 for wider compatibility

67fe0897

cleanup

e85e8bcc

reeselevine requested a review 3 days ago

reeselevine changed the title ~~Move quantized buffers to u32 types and some other changes for wider browser/device support~~ ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support 3 days ago

github-actions added ggml

github-actions added WebGPU

Reviewers

No reviews

Assignees

No one assigned

Labels

ggml WebGPU

Milestone

No milestone

llama.cpp ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support #21046 Open

ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support #21046

llama.cpp
ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support
#21046

Open