llama.cpp
21c84b5d - CUDA: fix Volta FlashAttention logic (#11615)

Commit

145 days ago

CUDA: fix Volta FlashAttention logic (#11615)

References

#11615 - CUDA: fix Volta FlashAttention logic

Author

JohannesGaessler

JohannesGaessler

Parents

Files2

ggml/src/ggml-cuda
- fattn-wmma-f16.cu
- fattn.cu