llama.cpp
82764c34 - ggml webgpu: quantized buffers to u32 + wider browser/device support (#21046)

Commit

13 days ago

ggml webgpu: quantized buffers to u32 + wider browser/device support (#21046) * Work towards removing bitcast * Move rest of existing types over * Add timeout back to wait and remove synchronous set_tensor/memset_tensor * move to unpackf16 for wider compatibility * cleanup * Remove deadlock condition in free_bufs

References

#21046 - ggml webgpu: move quantized buffers to u32 types and some other changes for wider browser/device support

Author

reeselevine

Parents

825eb91a

llama.cpp 82764c34 - ggml webgpu: quantized buffers to u32 + wider browser/device support (#21046)

llama.cpp
82764c34 - ggml webgpu: quantized buffers to u32 + wider browser/device support (#21046)