lighteval
c7a063ae
- Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark (#983)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
125 days ago
Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark (#983) * add slr_bench evals function * implement feedback on PR * remove logging and raise exception when judge not loaded
References
#983 - Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark
Author
Ahmad21Omar
Parents
5137e03f
Loading