[mosaic_gpu] Make cupti finalization optional.
cupti initialization / finalization is somewhat expensive. This gives us the option of avoiding repeated initialization when performing multiple cupti timings. Disable kernel activity to ensure we've restored cupti to its original state.
PiperOrigin-RevId: 738685851