Set use_cuda_graphs in fp8_gemm_rowwise
Summary: The default value for use_cuda_graphs was changed to False in D64471087 and this caused slowdowns in triton/ck kernels for fp8_gemm_rowwise.
Reviewed By: danzimm
Differential Revision: D65140285
fbshipit-source-id: 4ab77537afeb9108dab7cdef6cac34aaa39d7d73