lighteval
Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark
#983
Merged

Adding New Task SLR-Bench as a Community Task : Scalable Logical Reasoning Benchmark #983

NathanHB merged 3 commits into huggingface:main from Ahmad21Omar:slr-bench
Ahmad21Omar
Ahmad21Omar add slr_bench evals function
512449bb
HuggingFaceDocBuilderDev
NathanHB
NathanHB commented on 2025-09-23
NathanHB NathanHB added new-task
Ahmad21Omar implement feedback on PR
e1add28d
Ahmad21Omar
NathanHB
NathanHB commented on 2025-09-24
Ahmad21Omar remove logging and raise exception when judge not loaded
85ed4897
NathanHB
NathanHB
NathanHB approved these changes on 2025-09-25
NathanHB NathanHB merged c7a063ae into main 199 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone