PR #11615 CUDA: fix Volta FlashAttention logic

CUDA: fix Volta FlashAttention logic #11615

ggerganov merged 1 commit into ggml-org:master from JohannesGaessler:cuda-fa-fix-logic

github-actions added Nvidia GPU

github-actions added ggml

JohannesGaessler requested a review from

ggerganov 276 days ago

CUDA: fix Volta FlashAttention logic

5ee63ee4

JohannesGaessler force pushed from ff0d3f67 to 5ee63ee4 276 days ago

ggerganov approved these changes on 2025-02-03

ggerganov merged 21c84b5d into master 276 days ago

Reviewers

ggerganov

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone