Fixing naming for sample evals + adding reqs in aime24 #989
homogeneize k and n in parametrizable metrics
96020089
updated aime, last metric fixs
4db46392
fix
393c4305
restore rm import
ce3e9435
restore
3d260734
update doc
3082a030
gpqa fix
038e64c0
pass at
238729dd
recall
00d6309c
test
2e435557
clefourrier
force pushed
from
8a5821c2
to
2e435557
86 days ago
Merge branch 'main' into mini-fixes
20fafe4b
Merge branch 'main' into mini-fixes
8282d84b
NathanHB
approved these changes
on 2025-10-14
Assignees
No one assigned
Labels
task-update
science-team
Login to write a write a comment.
Login via GitHub