llama.cpp
CUDA: GDN hide memory latency
#20537
Merged

CUDA: GDN hide memory latency #20537

am17an merged 1 commit into ggml-org:master from am17an:cuda_gdn_load2
am17an
am17an CUDA: GDN hide memory latency
0b60e3f0
ggerganov
ggerganov approved these changes on 2026-03-14
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
IMbackK
am17an am17an merged 34818ea6 into master 22 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone