llama.cpp
llama : enable chunked fused GDN path
#20340
Merged

llama : enable chunked fused GDN path #20340

ggerganov merged 9 commits into master from gg/llama-allow-gdn-ch
ggerganov
ggerganov llama : enable chunked fused GDN path
ec2443a9
ggerganov ggerganov requested a review from CISC CISC 8 days ago
ggerganov ggerganov requested a review from am17an am17an 8 days ago
github-actions github-actions added model
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
am17an
am17an approved these changes on 2026-03-10
ggerganov
am17an
ggerganov
am17an
ggerganov
am17an
ggerganov models : avoid Q and K repeats when using fused GDA
39b6f5a7
ggerganov ggerganov force pushed from 444eeed6 to 39b6f5a7 8 days ago
ggerganov
am17an
am17an commented on 2026-03-10
ggerganov cont : fix comment
79541c0a
am17an
am17an commented on 2026-03-10
ggerganov cont : fix the fix
46c693dc
ggerganov cont : fix
c6b76caf
CISC
CISC commented on 2026-03-10
ymcki
am17an
ProgenyAlpha
am17an
ProgenyAlpha
ggerganov metal : add GDN kernel (#20361)
f6a0c16e
github-actions github-actions added Apple Metal
ORippler CUDA: AR gated delta net improvements (#20391)
d1b2301f
github-actions github-actions added testing
ggerganov Merge branch 'master' into gg/llama-allow-gdn-ch
1baffbcb
ggerganov llama : refactor llm_build_delta_net_base API
bac4f0f8
CISC
ggerganov
lhez
CISC
ggerganov
ProgenyAlpha
CISC
ggerganov ggerganov merged d28961d8 into master 7 days ago
ggerganov ggerganov deleted the gg/llama-allow-gdn-ch branch 7 days ago
CISC
sultanqasim
ProgenyAlpha
am17an
ProgenyAlpha
am17an
ggml-org ggml-org locked as too heated and limited conversation to collaborators 6 days ago
ggml-org ggml-org unlocked this conversation 6 days ago
ProgenyAlpha

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone