llama.cpp
[CANN]: add the basic supports of Flash Attention kernel
#13627
Merged

[CANN]: add the basic supports of Flash Attention kernel #13627

shibizhao
shibizhao cann: add the basic FA support
72df31df
shibizhao cann: update the readme
3a731825
shibizhao cann: update the FlashAttention with PSEShift
6a39d638
shibizhao cann: update the input parameters in FA
8a902b98
shibizhao cann: update the alibi with max_bias
f5e24a5c
shibizhao cann: add the constrints of softcap
c8c2908b
shibizhao cann: update the docs CANN.md
47f2c646
shibizhao cann: update the docs CANN.md
fb62f015
github-actions github-actions added documentation
github-actions github-actions added ggml
shibizhao shibizhao changed the title cann: add the basic supports of Flash Attention kernel [CANN]: add the basic supports of Flash Attention kernel 218 days ago
hipudding hipudding requested a review from hipudding hipudding 218 days ago
hipudding hipudding added Ascend NPU
shibizhao
hipudding
shibizhao
noemotiovon
noemotiovon commented on 2025-05-21
shibizhao cann: fix typo of CANN.md
b266beb2
shibizhao cann: add some comments and update the CANN.md
8a112f0a
shibizhao cann: update the CANN.md
1779e008
shibizhao
noemotiovon
noemotiovon commented on 2025-05-21
shibizhao cann: update the inner precise for fusedInferAttention
092ccf68
shibizhao
noemotiovon
noemotiovon commented on 2025-05-22
shibizhao cann: update the constraints of flash_attn_ext on ggml-cann.cpp
c380305b
shibizhao
noemotiovon
hipudding
shibizhao cann: resolve the conflict with laster master branch
89f884e6
shibizhao
shibizhao Merge branch 'master' into flash-attn-cann
1a3bfecb
shibizhao cann: clean the whitespace
3b084d5b
shibizhao cann: clean the whitespace
d23697b8
shibizhao cann: add a new endline
8a7829b7
hipudding
hipudding approved these changes on 2025-05-26
hipudding
hipudding hipudding merged 2d38b6e4 into master 212 days ago
shibizhao shibizhao deleted the flash-attn-cann branch 212 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone