lighteval
d50bc307
- Add pass@1 for GPQA-D and MATH-500 (#698)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
251 days ago
Add pass@1 for GPQA-D and MATH-500 (#698) * Add pass@1 for GPQA-D and clean up AIME * Add pass@1 for math_500 * Add pass@1 for MATH-500 * Update test * Fix
References
#698 - Add pass@1 for GPQA-D and MATH-500
Author
lewtun
Parents
96e885d4
Loading