mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection #27099
mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection
2e0524ec
Address comments from the reviewers
c7ee53ca
webgpu: optimize Gemm and MatMul using subgroup feature (#26433)
4a266fd3
[QNN-EP] Implement file mapped weights feature (#26952)
e48c6378
[WebGPU EP] Reduce duplicated code in `MatMulReadFnSource()` (#27151)
ed5ffe90
mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection
378e7cb9
Address the comments from reviewers, fix failing tests and reduce sta…
a61fd540
Update qnn_backend_manager.h
c0946f12
Address comments from reviewers
71fa09f8
Move comment to more appropriate place
bd38b0e2
milpuz01
force pushed
from
2d058537
to
bd38b0e2
119 days ago
Fix bad meerge
49b58749
milpuz01
deleted the aarch64_convolutions branch 118 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub