Update copy_ op
Note that we lower both `copy` and `copy_` ops. When we only lower
`copy` op, the in-place version `copy_` op gets auto-generated. However,
upstream PyTorch had an assertion that this auto-generated was wrong.
To by pass, we're manually lowering `copy_` as well.