PR #20128 CUDA: use shared mem for ssm_conv

CUDA: use shared mem for ssm_conv #20128

am17an merged 5 commits into ggml-org:master from am17an:cuda_ssm_conv

CUDA: use shared mem for ssm_conv

0141e9c0

am17an requested a review from

ggerganov 24 days ago

am17an requested a review from

JohannesGaessler 24 days ago

github-actions added testing

github-actions added Nvidia GPU

github-actions added ggml

fuse silu + ssm_conv

7ba1b0ae

fuse unary + mul

de3856d4

CISC commented on 2026-03-05

enable for fp16

d440e643

JohannesGaessler approved these changes on 2026-03-06

formatting

43f3f54f

am17an merged 1e38a7a6 into master 23 days ago

am17an deleted the cuda_ssm_conv branch 23 days ago

IMbackK commented on 2026-03-06

Reviewers

JohannesGaessler

ggerganov

IMbackK

CISC

Assignees

No one assigned

Labels

testing Nvidia GPU ggml

Milestone

No milestone