llama.cpp
CUDA: Conv2d Tensor Core
#15813
Open

CUDA: Conv2d Tensor Core #15813

mnehete32 wants to merge 3 commits into ggml-org:master from mnehete32:conv2d_tensor_core
mnehete32
mnehete32 CUDA: cov2d with tensor core
19596b17
mnehete32 CUDA: conv2d added comment
96db6275
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
mnehete32 CUDA: conv2d support fp16 without wmma
2cd9fb0f
mnehete32 mnehete32 force pushed from 57aa09e0 to 2cd9fb0f 5 days ago
JohannesGaessler
JohannesGaessler commented on 2025-09-05
Green-Sky
JohannesGaessler
ggerganov
Green-Sky
mnehete32
mnehete32
JohannesGaessler
mnehete32
JohannesGaessler

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone