Hotfix for litellm judge (#490)
* Made litellm judge backend more robust.
* Added failed flag to ModelResponse.
* Fixed wrong model response.
* Removed model response and replaced with string.
---------
Co-authored-by: Clémentine Fourrier <22726840+clefourrier@users.noreply.github.com>