lighteval
7028af37 - Add custom tasks for evaluation of french models (#505)

Commit
328 days ago
Add custom tasks for evaluation of french models (#505) * Add tasks for benchmark of french models * Remove duplicated code, metric imported from ifeval main file * Remove 'loglikelihood single token' for running GPQA with vllm * Change subset for gpqa-fr task
Author
Parents
Loading