llama.cpp
03ea0417 - ggml webgpu: minor set rows optimization (#16810)

Commit

133 days ago

ggml webgpu: minor set rows optimization (#16810) * Add buffer label and enable dawn-specific toggles to turn off some checks * Minor set_rows optimization (#4) * updated optimization, fixed errors * non vectorized version now dispatches one thread per element * Simplify * Change logic for set_rows pipelines --------- Co-authored-by: Neha Abbas <nehaabbas@macbookpro.lan> Co-authored-by: Neha Abbas <nehaabbas@ReeseLevines-MacBook-Pro.local> Co-authored-by: Reese Levine <reeselevine1@gmail.com> * Comment on dawn toggles * Remove some comments * Implement overlap binary operators * Revert "Implement overlap binary operators" This reverts commit ed710b36f51ab3f53fa13db15c1685dc8678a32a. * Disable support for non-contiguous binary_op tensors and leave note for future support --------- Co-authored-by: neha-ha <137219201+neha-ha@users.noreply.github.com> Co-authored-by: Neha Abbas <nehaabbas@macbookpro.lan> Co-authored-by: Neha Abbas <nehaabbas@ReeseLevines-MacBook-Pro.local>

References

#16810 - ggml webgpu: minor set rows optimization

Author

reeselevine

Parents

cdabeb2c

llama.cpp 03ea0417 - ggml webgpu: minor set rows optimization (#16810)

llama.cpp
03ea0417 - ggml webgpu: minor set rows optimization (#16810)