adding tolerance for numeric test of checkpointing (#9404)
There was initially concern that the numerics should be exact between activation remat and not. The rematerialized activation should be precise, however, the XLA compiler may re-order the ops, so the final update may deviate slightly, and the final loss of the model could vary even more than that.