onnxruntime
NEON kernels for NCHWc Convolution and Pooling
#25580
Merged

NEON kernels for NCHWc Convolution and Pooling #25580

Rohanjames1997
Rohanjames1997 Rewire ORT to support a NEON version of NCHWc Conv
cc30a146
Rohanjames1997 Remove reference to assembly file
f190c0d7
Rohanjames1997 Add a NEON kernel for Pointwise Convolution
632870bb
Rohanjames1997 Add a NEON kernel for Depthwise
159570a9
Rohanjames1997 Remove placeholder implementations
52f09bf4
Rohanjames1997 Add placeholder kernel for MlasConvNchwcFloatKernelNeon
b505bd64
Rohanjames1997 Fix MlasConvNchwcFloatKernelNeon
790cc7ed
Rohanjames1997 Use MLAS intrinsics for MlasConvNchwcFloatKernelNeon
906393af
Rohanjames1997 Add MlasConvNchwFloatKernelNeon
4d322e64
Rohanjames1997 Add placeholder NCHWc Pool
cb06a1a4
Rohanjames1997 Vanilla C++ implementation
00caa4c1
Rohanjames1997 Intrinsics for Pooling
4cead5ec
Rohanjames1997 Refactored to share code
abd54916
Rohanjames1997 Format file & delete unused header
74e0e3b0
Rohanjames1997 Minor modifications to pass more tests
16be947d
Rohanjames1997 Remove unnecessary code & formatting changes
f7d971d3
Rohanjames1997 Refactor to share some code
0ff394cd
Rohanjames1997 Change block size to 16
bd2b6c44
Rohanjames1997 Update pooling algorithm for block size 16
2b783776
Rohanjames1997 Remove comment
ee9b9431
Rohanjames1997
Rohanjames1997
hariharans29 hariharans29 closed this 107 days ago
hariharans29 hariharans29 reopened this 107 days ago
hariharans29 hariharans29 requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 107 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-09-05
hariharans29
hariharans29
hariharans29 commented on 2025-09-05
hariharans29
hariharans29 commented on 2025-09-05
hariharans29
hariharans29 commented on 2025-09-05
Rohanjames1997
Rohanjames1997 Add correct header and refactor kernels to share code.
23425e8e
hariharans29
hariharans29
azure-pipelines
hariharans29
hariharans29 commented on 2025-09-09
Rohanjames1997 Address Copilot comments
7000e9fe
Rohanjames1997 Extend kernels to Windows & Apple
c5c3f051
Rohanjames1997 Merge remote-tracking branch 'upstream/main' into nchwc_conv_pool
619d87c2
hariharans29
azure-pipelines
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
Rohanjames1997 Hardcode BlockSize to 16 and add it to the header.
506bf053
Rohanjames1997
hariharans29
azure-pipelines
hariharans29
Rohanjames1997
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
hariharans29
hariharans29 commented on 2025-09-12
hariharans29
hariharans29 commented on 2025-09-12
hariharans29
hariharans29 commented on 2025-09-12
hariharans29
edgchen1
edgchen1 commented on 2025-09-12
Rohanjames1997
Rohanjames1997 Increase android build size to 10% higher than the CI-reported size o…
fb5fb504
Rohanjames1997 Centralize MLAS_NEON_NCHWC_BLOCK_SIZE
fb99f7dc
hariharans29
azure-pipelines
Rohanjames1997 Merge branch 'microsoft:main' into nchwc_conv_pool
aa21aca3
Rohanjames1997
hariharans29
azure-pipelines
hariharans29
azure-pipelines
hariharans29
azure-pipelines
Rohanjames1997
hariharans29 hariharans29 closed this 99 days ago
hariharans29 hariharans29 reopened this 99 days ago
hariharans29
Rohanjames1997
hariharans29
azure-pipelines
hariharans29
hariharans29 hariharans29 closed this 96 days ago
hariharans29 hariharans29 reopened this 96 days ago
hariharans29
azure-pipelines
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
hariharans29 approved these changes on 2025-09-15
hariharans29 hariharans29 merged 2d2a3e57 into main 96 days ago
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
hariharans29 commented on 2025-09-17
hariharans29
hariharans29 commented on 2025-09-17
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
hariharans29
Rohanjames1997
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
Rohanjames1997
hariharans29
hariharans29
Rohanjames1997
hariharans29

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone