Add batch-invariant accuracy mode for benchmark perf tests (#180610)
Summary:
Adds a new `--batch-invariant` accuracy mode to the dynamo benchmark harness and wires it into the nightly H100 inductor dashboard.
For a model with full batch size N, the mode runs the compiled forward at N, N/2, N/4, ..., 1, and for each smaller size checks that the output is bitwise equal to the full-batch reference sliced down to that size. A
The plan is to land this first so we have continuous dashboard signal, then drive model pass-rates up by fixing the underlying inductor issues model-by-model.
CI:
- `.ci/pytorch/test.sh` dispatches the accuracy run whenever `DASHBOARD_TAG` contains `batch_invariant_accuracy-true`.
X-link: https://github.com/pytorch/pytorch/pull/180610
Approved by: https://github.com/jansel
Reviewed By: wdvr
Differential Revision: D101732433
fbshipit-source-id: 7bf17aaaf16acfd66cd035c4d3d26c2ebcb8ad2d
Co-authored-by: Claude