llama.cpp
CUDA: add FP32 FlashAttention vector kernel
#7188
Merged

Commits
  • CUDA: add FP32 FlashAttention vector kernel
    JohannesGaessler committed 2 years ago
  • fixup! CUDA: add FP32 FlashAttention vector kernel
    JohannesGaessler committed 2 years ago
  • fixup! fixup! CUDA: add FP32 FlashAttention vector kernel
    JohannesGaessler committed 2 years ago
  • fixup! fixup! fixup! CUDA: add FP32 FlashAttention vector kernel
    JohannesGaessler committed 2 years ago
Loading