lighteval
c7a063ae - Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark (#983)

Commit
125 days ago
Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark (#983) * add slr_bench evals function * implement feedback on PR * remove logging and raise exception when judge not loaded
Author
Parents
Loading