PR #13415 CUDA: fix FlashAttention on Turing

CUDA: fix FlashAttention on Turing #13415

JohannesGaessler merged 1 commit into ggml-org:master from JohannesGaessler:cuda-fix-turing-fa

CUDA: fix FlashAttention on Turing

6fe0f09c

github-actions added Nvidia GPU

github-actions added ggml

slaren approved these changes on 2025-05-09

JohannesGaessler merged d8919424 into master 248 days ago

Reviewers

slaren

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone