llama.cpp
647b960b - ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031)

Commit
64 days ago
ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) * Faster tensors (#8) Add fast matrix and matrix/vector multiplication. * Use map for shader replacements instead of pair of strings
Author
Parents
Loading