[functorch] Minor improvements for _autograd_grad (pytorch/functorch#750)
I was really annoyed at the fact that we preallocate result
tensors for everything and then throw most of them out. New
code variant doesn't do that.
Signed-off-by: Edward Z. Yang <ezyang@fb.com>