Wrap Caffe2 (RowWise)SparseAdagrad fusion operator as a PT op (#38704)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/38704
This diff wraps Caffe2's (RowWise)SparseAdagrad fusion operator on GPU as a PT op.
Reviewed By: jianyuh, xw285cornell
Differential Revision: D21511611
fbshipit-source-id: 1a0bb8252ec0a8229eb80708338cb23008cfb26d