Add pass@1 for GPQA-D and MATH-500 #698
Add pass@1 for GPQA-D and clean up AIME
6918cd93
lewtun
commented
on 2025-05-01
lewtun
commented
on 2025-05-01
Add pass@1 for math_500
978c1434
lewtun
changed the title Add pass@1 for GPQA-D and clean up AIME Add pass@1 for GPQA-D and MATH-500 245 days ago
Add pass@1 for MATH-500
6c5e1b0e
lewtun
commented
on 2025-05-01
Update test
7cd9c26b
Fix
7f295e66
NathanHB
approved these changes
on 2025-05-05
lewtun
merged
d50bc307
into main 241 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub