Add kernels submodule (#2380)
Summary:
`triton.ops` has been deprecated. We are now importing kernels from triton-lang/kernels submodule.
Since there is a bug in triton-lang/kernels (https://github.com/xuzhao9/kernels/commit/c907d8ec7aa46d7c722dc06f9020e91b346cafbd), we are using submodules in a downstream fork.
Pull Request resolved: https://github.com/pytorch/benchmark/pull/2380
Reviewed By: manman-ren, sijiac
Differential Revision: D59868848
Pulled By: xuzhao9
fbshipit-source-id: e361e6ba2b80802f6c3242773c4dd4a5af6356da