lighteval
Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark
#983
Merged

Loading