Add conv2d Implicit GEMM #15805
Add implicit GEMM convolution operation for 2D tensors in CUDA
8a589317
Add implicit convolution support for 2D tensors in CPU and CUDA imple…
4d772873
fix passing param as reference
3877608d
Fix parameter order in conv2d_implicit and add comprehensive test cas…
6d84cbb5
Fix boundary check in conv2d_implicit_kernel to include channel limits
5ffe97be
bssrdf
marked this pull request as draft 9 days ago
Refactor conv2d_implicit_kernel for improved readability and consiste…
4b0f9d57
Refactor conv2d_implicit_kernel for improved bitwise operations; add …
83a3b7d6
merged with upstream master
735886b0
Merge branch 'master' into conv2d-implicit
2ec76aa8
Assignees
No one assigned
Labels
testing
Nvidia GPU
ggml
Login to write a write a comment.
Login via GitHub