llama.cpp
88519fbf - cuda : implement soft_max_ext

Commit
1 year ago
cuda : implement soft_max_ext
Author
Committer
Parents
  • File
    ggml-cuda.cu
  • File
    ggml.c
  • File
    llama.cpp