cuda: refactored ssm_scan and use CUB #13291
cuda: refactored ssm_scan to use CUB
b2f8eea9
fixed compilation error when when not using CUB
c7d4d45f
Your-Cheese
force pushed
from
3a454c91
to
c7d4d45f
159 days ago
assign L to constant and use size_t instead of int
949e4fa2
deduplicated functions
75520d67
change min blocks per mp to 1
7e559f3e
Use cub load and store warp transpose
7d259d9e
Merge https://github.com/ggml-org/llama.cpp into ssm_scan_cub
ae519a48
IMbackK
dismissed these changes
on 2025-08-06
suppress clang warning
dd6ff8e5
Assignees
No one assigned