llama.cpp
55717c98
- metal : warp-based reduction for soft max kernel
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
metal : warp-based reduction for soft max kernel
References
#4256 - ggml : add ggml_soft_max_ext
Author
ggerganov
Committer
ggerganov
Parents
68e02c0d
Files
2
ggml-metal.m
ggml-metal.metal
Loading