llama.cpp
CUDA: add conv_2d_dw
#14265
Merged

CUDA: add conv_2d_dw #14265

am17an merged 4 commits into ggml-org:master from am17an:add_conv2d_dw_cuda
am17an
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov ggerganov requested a review from JohannesGaessler JohannesGaessler 338 days ago
JohannesGaessler
JohannesGaessler commented on 2025-06-19
JohannesGaessler
am17an CUDA: add conv_2d_dw
f177231c
am17an better naming
6eb7fbbe
am17an simplify using template
2c60d2cc
am17an Review: fix operation ordering in ggml-cuda, use __forceinline__, use…
d64ba79d
am17an am17an force pushed to d64ba79d 337 days ago
Acly
JohannesGaessler
JohannesGaessler approved these changes on 2025-06-19
am17an am17an merged 9eaa51e7 into master 337 days ago
am17an am17an deleted the add_conv2d_dw_cuda branch 330 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone