refactor: upgrade to lightning 2
this requires that the vocoder switches to manual optimization which means that we can't use gradient_clip_val by default. this commit updates the requirements and the helper accordingly, and adds gradient_clip_val=1.0 to the feature prediction network.