benchmark
2b283c94 - Skip slow tests for aarch64-inductor-benchmarks (#158842)

Commit

219 days ago

Skip slow tests for aarch64-inductor-benchmarks (#158842) Summary: This PR suggests adding some models to `cpu_skip_list` which are currently being run in TIMM and Torchbench. The suggested models takes a long time which leads to the benchmark runs being `timeout`. [benchmark runs for aarch64](https://github.com/pytorch/pytorch/actions/workflows/inductor-perf-test-nightly-aarch64.yml) • The issue stems from unoptimized groupwise convolution (BF16 /F16 dtype) kernels for aarch64 platforms , which significantly slow down execution leading to the timeout. **Action:** • An optimized BF16 groupwise convolution kernel is currently being developed in oneDNN, targeted for release in Q4 2025. To maintain dashboard consistency and signal clarity, I’ve skipped the affected tests in: * timm benchmarks * torchbench benchmarks As suggested, skip is applied at the CPU - arch level, explicitly branching for aarch64 and adding models which needs to be skipped. This keeps the logic clean, but: • An alternative considered was increasing shard counts for aarch64 runners, but given the known performance bottleneck, skipping avoids wasted compute cycles. Suggestions around this will be appreciated. Benchmark does not timeout after the suggested change: https://github.com/pytorch/pytorch/actions/runs/16447200138 X-link: https://github.com/pytorch/pytorch/pull/158842 Approved by: https://github.com/malfet Reviewed By: ZainRizvi Differential Revision: D78874148 fbshipit-source-id: 1df281c51e613673852836a2546fdc9aa731932e

Author

generatedunixname499836121

Committer

facebook-github-bot

Parents

4fd824e4

benchmark 2b283c94 - Skip slow tests for aarch64-inductor-benchmarks (#158842)

benchmark
2b283c94 - Skip slow tests for aarch64-inductor-benchmarks (#158842)