llama.cpp
55717c98 - metal : warp-based reduction for soft max kernel

Commit
1 year ago
metal : warp-based reduction for soft max kernel
Author
Committer
Parents
  • File
    ggml-metal.m
  • ggml-metal.metal