fix(Phi4Multimodal): Fix incorrect default vision/audio config initialization in Phi4MultimodalConfig (#43480)
* fix(config): Ensure default instantiation of vision and audio configurations in Phi4MultimodalConfig
* fix(config): Ensure default instantiation of audio configuration in Phi4MultimodalConfig