onnxruntime
mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection
#27099

Merged

mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection #27099

hariharans29 merged 11 commits into microsoft:main from milpuz01:aarch64_convolutions

Rohanjames1997 commented on 2026-01-23

hariharans29 commented on 2026-01-23

hariharans29 requested a review from

copilot-pull-request-reviewer 136 days ago

copilot-pull-request-reviewer commented on 2026-01-26

milpuz01 requested a review from

hariharans29 127 days ago

milpuz01 requested a review from

Rohanjames1997 127 days ago

Rohanjames1997 commented on 2026-02-04

hariharans29 requested a review from

copilot-pull-request-reviewer 125 days ago

copilot-pull-request-reviewer commented on 2026-02-06

hariharans29 commented on 2026-02-06

milpuz01 requested a review from

hariharans29 122 days ago

hariharans29 commented on 2026-02-09

hariharans29 requested a review from

copilot-pull-request-reviewer 122 days ago

copilot-pull-request-reviewer commented on 2026-02-09

milpuz01 requested a review from

hariharans29 120 days ago

mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection

2e0524ec

Address comments from the reviewers

c7ee53ca

webgpu: optimize Gemm and MatMul using subgroup feature (#26433)

4a266fd3

[QNN-EP] Implement file mapped weights feature (#26952)

e48c6378

[WebGPU EP] Reduce duplicated code in `MatMulReadFnSource()` (#27151)

ed5ffe90

mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection

378e7cb9

Address the comments from reviewers, fix failing tests and reduce sta…

a61fd540

Update qnn_backend_manager.h

c0946f12

Address comments from reviewers

71fa09f8

Move comment to more appropriate place

bd38b0e2

milpuz01 force pushed from 2d058537 to bd38b0e2 119 days ago

Fix bad meerge

49b58749

hariharans29 approved these changes on 2026-02-13

hariharans29 enabled auto-merge (squash) 119 days ago

hariharans29 merged bd8f781f into main 118 days ago

milpuz01 deleted the aarch64_convolutions branch 118 days ago

Reviewers

hariharans29

copilot-pull-request-reviewer

Rohanjames1997

Assignees

No one assigned

Labels

None yet

Milestone

No milestone

onnxruntime mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection #27099 Merged

mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection #27099

onnxruntime
mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection
#27099

Merged