llama.cpp
a1c004ef - ggml : add ggml_flash_attn_ext API

Commit
1 year ago
ggml : add ggml_flash_attn_ext API
Author
Committer
Parents
  • File
    ggml-metal.m
  • ggml-metal.metal
  • File
    ggml.c
  • File
    ggml.h
  • File
    llama.cpp
  • tests
    • File
      test-backend-ops.cpp