PR #16559 metal: optimise `GGML_OP_SUM`

metal: optimise `GGML_OP_SUM` #16559

ggerganov merged 4 commits into ggml-org:master from cern1710:metal-optimize-op-sum

optimise GGML_OP_SUM

9cc51d39

cern1710 requested a review from

ggerganov 232 days ago

github-actions added ggml

github-actions added Apple Metal

add non-contiguous tests by permuting the input

4619142b

cern1710 requested a review from

slaren 232 days ago

github-actions added testing

change tests to require full contiguity of OP_SUM

c25a6c7c

ggerganov approved these changes on 2025-10-14

cuda : add check GGML_OP_SUM

be1b6703

ggerganov merged f4ce81c4 into master 230 days ago

github-actions added Nvidia GPU

Reviewers

ggerganov

slaren

Assignees

No one assigned

Labels

testing Nvidia GPU ggml Apple Metal

Milestone

No milestone