[MLAS/NEON] Add dedicated kernel for depthwise convolution for ARM64 using NEON intrinsics #26688
Initial commit
160c71e2
More changes
a134ea00
More changes
a44f7085
Fix builds
3a3ccf76
Fix builds 2
212dbf1e
Threaded
fceae09b
Fix x64 builds
3793d70f
Experiment
481a7f66
hariharans29
changed the title WIP: Conv Expt [DO NOT REVIEW] WIP: Conv Expt 204 days ago
Experiment revert
8993a0a0
Refactor
d765c1a1
More changes
a428d509
a
d53dd15b
Try
8f12c51c
More changes
67b68013
Relax padding
01b43fb6
Vanilla NEON Depthwise
ea833947
Fix indexing
dd94a3b0
Add benchmark
ffd291ad
Add lambda
92fb6045
Rework
d15bb930
hariharans29
changed the title [DO NOT REVIEW] WIP: Conv Expt [MLAS/NEON] Add dedicated kernel for depthwise convolution for ARM64 using NEON intrinsics 202 days ago
Update onnxruntime/test/mlas/bench/bench_sconv.cpp
119ec9a7
Fix
d0fc1431
Remove Winograd implementation
2820a842
Update onnxruntime/core/mlas/lib/sconv_nchw_kernel_neon.cpp
0ffb8119
Update onnxruntime/core/mlas/lib/sconv_nchw_kernel_neon.cpp
59e2b2dc
Update onnxruntime/core/mlas/lib/convolve.cpp
e34c930a
Update onnxruntime/core/mlas/inc/mlas.h
027e7429
Update onnxruntime/core/mlas/lib/sconv_nchw_kernel_neon.cpp
f93ed67a
Update onnxruntime/core/mlas/lib/convolve.cpp
f5c1b812
Benchmark updates
f15e5540
Merge remote-tracking branch 'origin/main' into hari/expt_conv
bb324b59
Merge remote-tracking branch 'origin/main' into hari/expt_conv
3c0ec53b
Merge remote-tracking branch 'origin' into hari/expt_conv
7b65a5fd
Use MLAS helpers instead of using NEON intrindics directly
60a2b81b
Resolve comments
245f1cde
Typo
24874919
Merge remote-tracking branch 'origin' into hari/expt_conv
65bc06f4
Fix
20d2bb11
Merge remote-tracking branch 'origin' into hari/expt_conv
9c2a6546
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub