[JIT] make RegisterCudaFuseGraph use TORCH_API instead of C10_EXPORT (#73742)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/73742
I think this was causing multiple implementations of
RegisterCudaFuseGraph on windows. It looks like both torch_cpu.dll and
torch_python.dll were exporting RegisterCudaFuseGraph implementations,
so RegisterCudaFuseGraph::isRegistered() would refer to a _different_
static bool depending on whether the caller was in torch_cpu.dll or
torch_python.dll. See #73717 for a demonstration.
Test Plan: Imported from OSS
Reviewed By: eellison
Differential Revision: D34618623
Pulled By: davidberard98
fbshipit-source-id: f9bc4fe792098bfabf50ecfcd1f785ed039184bd
(cherry picked from commit 2de58679342914f456d63a9f952357ac163cc4ec)