[Inductor] Optimize read write merging in FusedSchedulerNode ctor (#105693)
Reduced optimizer compilation time by half, I think it will improve it in general as well.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/105693
Approved by: https://github.com/jansel