llama.cpp
CANN: support gated linear attn
#18653
Merged

CANN: support gated linear attn #18653

hipudding merged 3 commits into ggml-org:master from hipudding:gla
hipudding
hipudding hipudding marked this pull request as draft 131 days ago
github-actions github-actions added ggml
github-actions github-actions added Ascend NPU
hipudding hipudding force pushed from 7c21f715 to 81f18f3c 131 days ago
hipudding hipudding force pushed from 81f18f3c to 746a693d 131 days ago
hipudding
hipudding hipudding marked this pull request as ready for review 131 days ago
hipudding hipudding requested a review from noemotiovon noemotiovon 131 days ago
noemotiovon
noemotiovon approved these changes on 2026-01-07
hipudding hipudding requested a review from noemotiovon noemotiovon 125 days ago
hipudding hipudding requested a review from ggerganov ggerganov 125 days ago
noemotiovon
noemotiovon approved these changes on 2026-01-13
CANN: support gated linear attn
4f81fc99
hipudding CANN: optimize OP gla
0809aa59
hipudding Remove unused comments
bfa67a8a
hipudding hipudding force pushed from 903d0ce8 to bfa67a8a 122 days ago
ggerganov
ggerganov approved these changes on 2026-01-16
hipudding
hipudding hipudding merged baa4ba0a into master 122 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone