llama.cpp
CANN: support gated linear attn
#18653
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
CANN: support gated linear attn
#18653
hipudding
merged 3 commits into
ggml-org:master
from
hipudding:gla
hipudding
marked this pull request as draft
131 days ago
github-actions
added
ggml
github-actions
added
Ascend NPU
hipudding
force pushed
from
7c21f715
to
81f18f3c
131 days ago
hipudding
force pushed
from
81f18f3c
to
746a693d
131 days ago
hipudding
marked this pull request as ready for review
131 days ago
hipudding
requested a review
from
noemotiovon
131 days ago
noemotiovon
approved these changes on 2026-01-07
hipudding
requested a review
from
noemotiovon
125 days ago
hipudding
requested a review
from
ggerganov
125 days ago
noemotiovon
approved these changes on 2026-01-13
CANN: support gated linear attn
4f81fc99
CANN: optimize OP gla
0809aa59
Remove unused comments
bfa67a8a
hipudding
force pushed
from
903d0ce8
to
bfa67a8a
122 days ago
ggerganov
approved these changes on 2026-01-16
hipudding
merged
baa4ba0a
into master
122 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
noemotiovon
Assignees
No one assigned
Labels
ggml
Ascend NPU
Milestone
No milestone
Login to write a write a comment.
Login via GitHub