onnxruntime
[CPU/CUDA EP] Add DeformConv op support
#27393
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
58
Changes
View On
GitHub
[CPU/CUDA EP] Add DeformConv op support
#27393
tianleiwu
merged 58 commits into
microsoft:main
from
ShirasawaSama:feature/add-deform-conv-2d-support
ShirasawaSama
changed the title
Feature/add deform conv 2d support
Add deform conv 2d support
87 days ago
fs-eire
requested a review
from
copilot-pull-request-reviewer
86 days ago
copilot-pull-request-reviewer
commented on 2026-02-20
ShirasawaSama
changed the title
Add deform conv 2d support
[CPU/CUDA EP] Add DeformConv op support
83 days ago
ShirasawaSama
force pushed
from
2d85c093
to
bb17da52
80 days ago
github-advanced-security
commented on 2026-02-28
tianleiwu
commented on 2026-02-28
ShirasawaSama
force pushed
from
bb17da52
to
7d2b779e
77 days ago
ShirasawaSama
marked this pull request as draft
77 days ago
ShirasawaSama
force pushed
from
1e5babad
to
1222ad4a
77 days ago
github-advanced-security
commented on 2026-03-01
ShirasawaSama
marked this pull request as ready for review
75 days ago
ShirasawaSama
requested a review
from
tianleiwu
75 days ago
github-advanced-security
commented on 2026-03-05
ShirasawaSama
marked this pull request as draft
73 days ago
ShirasawaSama
marked this pull request as ready for review
71 days ago
ShirasawaSama
force pushed
from
9d7d29dc
to
cbe1eca8
71 days ago
tianleiwu
dismissed these changes on 2026-03-11
tianleiwu
requested a review
from
copilot-pull-request-reviewer
67 days ago
tianleiwu
dismissed their stale review
67 days ago
please address remaining issues
copilot-pull-request-reviewer
commented on 2026-03-11
ShirasawaSama
marked this pull request as draft
66 days ago
ShirasawaSama
marked this pull request as ready for review
66 days ago
ShirasawaSama
force pushed
from
7adfb05a
to
c5d86e71
64 days ago
tianleiwu
requested a review
from
copilot-pull-request-reviewer
62 days ago
copilot-pull-request-reviewer
commented on 2026-03-16
tianleiwu
dismissed these changes on 2026-03-16
ShirasawaSama
dismissed their stale review via
d7127544
62 days ago
tianleiwu
dismissed these changes on 2026-03-17
tianleiwu
enabled auto-merge (squash)
60 days ago
disabled auto-merge
60 days ago
Head branch was pushed to by a user without write access
ShirasawaSama
dismissed their stale review via
454eea1d
60 days ago
Add deform conv 2d cpu execution provider support
4790b094
Add more tests
abfec39e
Add cuda support for deformconv2d
a0c50604
Improve deformconv cuda pref
dd8e7f1c
Add more test cases
c5bd48af
Fix copilot suggestions
952b3a12
Fix default attrs value of DeformConv
e5c043c6
Fix schema definition for DeformConv op
eee517da
Refactor DeformConv test cases
12b19c8a
Fix OrtMemTypeCPUInput issue and add cuda error check
d6c19be5
Remove GemmEx double specialization
12fd042b
Fix potential integer overflow in CUDA DeformableIm2ColKernel
9b069e33
Optimize CPU DeformableIm2Col loop order for better cache locality
cbadf131
Parallelize CPU DeformConv Im2Col and bias addition
a9515683
Use GPU free memory in DeformConv temp memory heuristic
f1a98325
Extract DeformConvAttributes to shared header
d99994ff
DeformConv op shared attributes and validation
7d7f66ea
Refactor attributes/validation and optimize CUDA DeformConvIm2Col kernel
8b5a13f5
Add DeformConv OnnxModelTest with reference ONNX model
e5ec6def
Optimize GetGreatestDivisorBelowBound in CUDA DeformConv
14cf455d
Document symmetric-padding-only limitation in deform_conv_test_gen
4121178a
Skip cuda DeformConv op copy kernel when cur_parallel==1
df9d0b10
Reformat code
03cc5e5b
Fix cuda fp16 test cases
d9f65fb7
Fix int64_t to ptrdiff_t conversion in deform_conv
15fe856f
Resolve pipeline failures caused by unit tests
931c3862
Add comments and handle unused variables
fedd3898
Address review feedback and align with Conv behavior
7ebc4988
Optimize DeformConv cpu bias add with Eigen SIMD
0479aded
Document GEMM layout trick in DeformConv cuBLAS path
f7819f14
Use int64_t for bilinear interpolation indices
34fae7d1
refactor(DeformConv CPU): template UseMask and improve im2col perform…
173fd6be
perf(DeformConv CPU): optimize im2col and BilinearInterpolate
33e4866b
Early OOB check for BilinearInterpolate
a482eb5e
Shrink DeformConv CUDA mutex to UpdateState only
b46f922c
Use cublasGemmStridedBatched for gemm_writes_directly path in DeformC…
82d12283
Early OOB check for BilinearInterpolate
a482eb5e
Shrink DeformConv CUDA mutex to UpdateState only
b46f922c
Fix var name
da18ee30
Drop mask==0 branch in im2col to match CPU behavior
dcd00c30
Fix C4244 in deform_conv_op_test by casting rtol/atol to float
f2d8f5df
Use cached totalGlobalMem for temp budget, remove cudaMemGetInfo and …
a25c1f4a
Document int indices in CUDA BilinearInterpolate
d98f3399
Document prime-batch fallback to single-image chunks in DeformConv Ge…
c2bf6f40
Refine deform conv test generator imports and ONNX model save usage
a8920b42
Optimize DeformConv CPU bilinear interpolation
b7b46813
Enforce 2D attribute lengths and validate kernel_shape/pads/overflow-…
46f176c4
Clarify DeformConv OnnxModelTest comment as ORT-reference smoke test
7cd167b2
DeformConv EmptyBatch test expects failure when batch size N is zero
ada5ca39
Allow DeformConv empty batch
17b155ac
Update docs
288e4c02
ShirasawaSama
force pushed
from
454eea1d
to
288e4c02
55 days ago
tianleiwu
enabled auto-merge (squash)
55 days ago
tianleiwu
approved these changes on 2026-03-23
tianleiwu
merged
163f6149
into main
55 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
tianleiwu
copilot-pull-request-reviewer
github-advanced-security
Assignees
No one assigned
Labels
None yet
Milestone
No milestone