change block codegen to handle new inlining in NNC (#47687)
Summary:
minor changes to block codegen to handle new inlining in NNC.
For Block code generation we need to delay inlining before collecting dimension data about the tensors.
We need to collect the dimension of the tensor before they were flattened. We don't have this information after the inlining pass, so for Block we run inling after we have collected this data using `CreateBufferMap` analysis.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/47687
Reviewed By: ZolotukhinM
Differential Revision: D24864869
Pulled By: protonu
fbshipit-source-id: 9574c0599f7d959a1cf0eb49d4e3e541cbe9b1d3