whisper.cpp
edd1d868 - HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)

Commit
353 days ago
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032) Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check Adds rocWMMA support to fattn-wmma-f16
Author
Committer
Parents
Loading