onnxruntime
mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection
#27099
Merged

Commits
  • mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection
    milpuz01 committed 119 days ago
  • Address comments from the reviewers
    milpuz01 committed 119 days ago
  • webgpu: optimize Gemm and MatMul using subgroup feature (#26433)
    milpuz01 committed 119 days ago
  • [QNN-EP] Implement file mapped weights feature (#26952)
    milpuz01 committed 119 days ago
  • [WebGPU EP] Reduce duplicated code in `MatMulReadFnSource()` (#27151)
    milpuz01 committed 119 days ago
  • mlas/arm64: add NEON conv asm kernels and tune NCHWC kernel selection
    milpuz01 committed 119 days ago
  • Address the comments from reviewers, fix failing tests and reduce stack spill
    milpuz01 committed 119 days ago
  • Update qnn_backend_manager.h
    milpuz01 committed 119 days ago
  • Address comments from reviewers
    milpuz01 committed 119 days ago
  • Move comment to more appropriate place
    milpuz01 committed 119 days ago
  • Fix bad meerge
    milpuz01 committed 119 days ago
Loading