lighteval
Use `n=16` samples to estimate `pass@1` for AIME benchmarks
#661
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
2
Changes
View On
GitHub
Use `n=16` samples to estimate `pass@1` for AIME benchmarks
#661
lewtun
merged 2 commits into
main
from
aime-pass@k
Use n=16 samples to estimate pass@1 for AIME benchmarks
7c7c28a0
Remove other metrics
35058768
lewtun
requested a review
from
NathanHB
332 days ago
lewtun
requested a review
from
clefourrier
332 days ago
clefourrier
approved these changes on 2025-04-07
lewtun
merged
f3639c67
into main
331 days ago
NathanHB
added
task-update
Login to write a write a comment.
Login via GitHub
Reviewers
clefourrier
NathanHB
Assignees
No one assigned
Labels
task-update
Milestone
No milestone
Login to write a write a comment.
Login via GitHub