reduce overhead in split and chunk for NestedTensor (#108213)
GH first copy of #108207
Uses raw pointers to reduce construction overhead.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/108213
Approved by: https://github.com/dracifer, https://github.com/jbschlosser