use inspect-ai to evaluate aime25 and gsm8k
2696a49c
revert file
578d5308
working for 3 tasks
21fa870d
parallel evals of tasks
27b2af11
adds gpqa diamond to inspect
b9a610dc
move tasks to individual files
25c11285
move tasks to individual files
0d42edf4
enable extended tasks as well
6cc3c041
run precomit hook
4c38951d
fix mkqa
d2fd5e1e
chaange extended suite to lighteval
2ddb0f94
chaange extended suite to lighteval
ee971228
add metdata to tasks
e2c8e226
add metdata to tasks
c980ddbe
remove license notice and put docstring on top of file
57fe3908
homogenize tags
ee081f20
add docstring for all multilingual tasks
1ed1602a
add docstring for all multilingual tasks
f4b0e274
add name and dataset to metadata
81d9e4ed
use TASKS_TABLE for multilingual tasks
b7345327
use TASKS_TABLE for default tasks
c3911fcf
use TASKS_TABLE for default tasks
e439f706
loads all tasks correclty
6447ee75
move community tasks to default tasks and update doc
88754bfa
move community tasks to default tasks and update doc
5445f5c0
Merge remote-tracking branch 'origin/main' into nathan-reorg-tasks
f53bd76f
revert uneeded changes
6a0c615d
fix doc build
1435e382
fix doc build
15f41f26
remove custom tasks and let user decide if loading multilingual tasks
74e5c0f4
load-tasks multilingual fix
aad136c1
update doc
242bc438
remove uneeded file
6806bf88
update readme
e94fa590
update readme
8800d1ac
update readme
970f33bf
fix test
b8c26dc2
add back the custom tasks
764de725
add back the custom tasks
a326ea86
fix tasks
81081cde
fix tasks
74b40f62
fix tasks
083fb1b5
fix tests
2dab2bfd
fix tests
57ca0e53
add inspect-ai
480e40af
NathanHB
changed the base branch from
main
to
nathan-reorg-tasks
176 days ago
NathanHB
marked this pull request as draft 176 days ago
add tasks
ade29007
add gpqa
079ceaf1
make model config work
8d007997
NathanHB
marked this pull request as ready for review 168 days ago
Update src/lighteval/metrics/metrics.py
cea5e997
init
fb47bb78
Merge branch 'nathan-move-to-inspectai' of github.com:huggingface/lig…
2736bc9c
NathanHB
changed the base branch from
nathan-reorg-tasks
to
main
167 days ago
Merge branch 'main' into nathan-move-to-inspectai
d5e6c9fd
fix tests
e55a9af9
Merge branch 'nathan-move-to-inspectai' of github.com:huggingface/lig…
ba41f1c3
fix tests
59c5dcc4
fix tests
40254db0
fix tests
53275fe9
add correct system prompt for hle
72e5c2b5
add correct system prompt for hle
7fc1753d
review suggestions
260d7443
add doc
835b7990
change buttons
c216a272
change buttons
21e6020f
change buttons
7e654008
move benchmark finder to openeval org
0a4f6bef
better help for eval
b661d0d7
better help for eval
f142b391
NathanHB
merged
880bebef
into main 162 days ago
Assignees
No one assigned
Labels
feature
enhancement
Login to write a write a comment.
Login via GitHub