llama.cpp
becade5d - HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)

Commit
1 year ago
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032) Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check Adds rocWMMA support to fattn-wmma-f16 --- Signed-off-by: Carl Klemm <carl@uvos.xyz> Co-authored-by: Johannes Gäßler <johannesg@5d6.de> Co-authored-by: Ben Jackson <ben@ben.com>
Author
Parents
Loading