llama.cpp
ggml : optimize cuda ssm_scan using warp-level reduction
#18505
Merged

ggml : optimize cuda ssm_scan using warp-level reduction #18505

Aadeshveer
Aadeshveer ggml : optimize cuda ssm_scan using warp-level reduction
64b08fed
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
jeffbolznv
Aadeshveer
am17an
Aadeshveer
pwilkin
gabe-l-hart
gabe-l-hart
gabe-l-hart approved these changes on 2026-01-05
Aadeshveer Aadeshveer force pushed from 53805208 to 0370ef9c 52 days ago
Aadeshveer
Aadeshveer Aadeshveer requested a review from gabe-l-hart gabe-l-hart 52 days ago
am17an
am17an approved these changes on 2026-01-06
Aadeshveer ggml : apply code review suggestions (style, const, constexpr)
67f2d003
Aadeshveer Aadeshveer force pushed from 0370ef9c to 67f2d003 52 days ago
Aadeshveer Aadeshveer requested a review from am17an am17an 52 days ago
Aadeshveer ggml : add TODO regarding stride consistency
4c8271c5
am17an am17an merged 24af22fc into master 52 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone