lighteval
f3639c67
- Use `n=16` samples to estimate `pass@1` for AIME benchmarks (#661)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
327 days ago
Use `n=16` samples to estimate `pass@1` for AIME benchmarks (#661) * Use n=16 samples to estimate pass@1 for AIME benchmarks * Remove other metrics
References
#661 - Use `n=16` samples to estimate `pass@1` for AIME benchmarks
Author
lewtun
Parents
fcb784d5
Loading