llama.cpp
Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention
#12032
Merged

Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention #12032

IMbackK merged 14 commits into ggml-org:master from pr
hjc4869
hjc4869 Add GGML_HIP_ROCWMMA_FATTN and rocwmma header check
206d22bd
hjc4869 Add rocWMMA support
02369da4
hjc4869 hjc4869 requested a review from JohannesGaessler JohannesGaessler 1 year ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
hjc4869 Merge branch 'master' into pr
547115da
hjc4869
JohannesGaessler
hjc4869
Headcrabed
JohannesGaessler
JohannesGaessler commented on 2025-02-23
hjc4869 Update ggml/src/ggml-hip/CMakeLists.txt
419f1ea9
hjc4869 Move comments to reduce confusion.
828577a9
hjc4869 Use namespace alias `wmma` instead of lots of ifdefs.
9d27c38b
hjc4869 Fix: FP16_MMA_AVAILABLE should not be checked in host code.
19272bfa
IMbackK
Beinsezii
Headcrabed
Beinsezii
hjc4869
bjj
bjj
JohannesGaessler
bjj
bjj
hjc4869
adelj88
JohannesGaessler
JohannesGaessler commented on 2025-02-25
hjc4869 Always return false in `fp16_mma_available` when compiling for HIP an…
29debe14
hjc4869 Remove the Q->ne[1] > 8 check
5d4ab04c
hjc4869
hjc4869 Also always return false in fp16_mma_hardware_available when compiled…
55169095
JohannesGaessler
JohannesGaessler commented on 2025-02-25
bjj
hjc4869 Revert "Also always return false in fp16_mma_hardware_available when …
fea171f5
Beinsezii
bjj
IMbackK
IMbackK
IMbackK
IMbackK IMbackK assigned IMbackK IMbackK 1 year ago
bjj
IMbackK
bjj ggml: Make fattn use hardware warp size instead of 32
a90f4cb7
bjj ggml: Make fattn kernel use launch bounds w/HIP
a135b4c7
hjc4869
IMbackK
hjc4869
IMbackK
IMbackK requested changes on 2025-03-03
hjc4869 Use GGML_CUDA_CC_IS_CDNA for checking CDNA architectures.
373d48ef
hjc4869 hjc4869 requested a review from IMbackK IMbackK 1 year ago
IMbackK
IMbackK approved these changes on 2025-03-03
IMbackK
ggerganov
IMbackK
IMbackK IMbackK merged becade5d into master 1 year ago
hjc4869 hjc4869 deleted the pr branch 1 year ago
Headcrabed
IMbackK
Headcrabed
hjc4869
JohannesGaessler
JohannesGaessler commented on 2025-03-06

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone