Fix decomposition for std (#87181)
The previous implementation was lacking a few features and incurred on a
pretty large error
cc @ezyang @mruberry @ngimel @Lezcano @fdrocha
Pull Request resolved: https://github.com/pytorch/pytorch/pull/87181
Approved by: https://github.com/ngimel, https://github.com/peterbell10