Enable Lowering Channels last Conv1x1 when max autotune is set (#107004)
This can lead to a large speedup when max autotune is set, e.g. resnet 2.1x -> 2.5x, particularly in combination with freezing.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/107004
Approved by: https://github.com/jansel, https://github.com/shunting314, https://github.com/int3
ghstack dependencies: #106911, #106912