[pt2] Turn on cudagraph tree in fbcode (#108416)
Summary:
cudagraph tree will significantly reduce the memory usage>
Memory consumption wise: {F1081833757}
with cudagraph tree: 65GB
w/o cudagraph tree: 83GB
Differential Revision: D48907239
Pull Request resolved: https://github.com/pytorch/pytorch/pull/108416
Approved by: https://github.com/eellison