lighteval
Fixing naming for sample evals + adding reqs in aime24
#989
Merged

Loading