llama.cpp
647b960b - ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031)

Commit

153 days ago

ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) * Faster tensors (#8) Add fast matrix and matrix/vector multiplication. * Use map for shader replacements instead of pair of strings

References

#17031 - ggml webgpu: faster matrix multiplication/matrix-vector multiplication

Author

reeselevine

Parents

299f5d78

llama.cpp 647b960b - ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031)

llama.cpp
647b960b - ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031)