lighteval
MMLU Redux and Fixing the caching
#883
Merged

MMLU Redux and Fixing the caching #883

clefourrier merged 26 commits into main from test_mmlu_redux_2
clefourrier
clefourrier init
d9de61d9
HuggingFaceDocBuilderDev
clefourrier
clefourrier commented on 2025-07-25
clefourrier Update src/lighteval/tasks/default_prompts.py
a2b1bad2
clefourrier clefourrier marked this pull request as draft 218 days ago
clefourrier Merge branch 'main' into test_mmlu_redux_2
0e009378
clefourrier small fixes
8429f5fb
clefourrier clefourrier marked this pull request as ready for review 211 days ago
clefourrier clefourrier requested a review from lewtun lewtun 211 days ago
clefourrier Merge branch 'main' into test_mmlu_redux_2
1cfa2f3f
clefourrier Merge branch 'main' into test_mmlu_redux_2
f600c40a
clefourrier Merge branch 'main' into test_mmlu_redux_2
541c89fd
clefourrier clefourrier requested a review from NathanHB NathanHB 187 days ago
NathanHB
NathanHB commented on 2025-08-25
NathanHB Apply suggestion from @NathanHB
1e139ab9
lewtun
lewtun approved these changes on 2025-08-28
clefourrier
clefourrier
lewtun
clefourrier
clefourrier fix metrics kwargs passing
b5975462
clefourrier add default metric for mmlu_redux
b0e55846
clefourrier fix
951cbc0c
clefourrier update caching"
96df0e7f
clefourrier Merge branch 'main' into test_mmlu_redux_2
971e0821
clefourrier better str for classes, which allows correct hashing
8b28aba1
clefourrier last fix is to possibly push to configs
a3eeebd5
clefourrier removed token system + added an actual separation between tasks with …
c7d1eb05
clefourrier fix
42ec1ce5
clefourrier clefourrier requested a review from NathanHB NathanHB 172 days ago
clefourrier
clefourrier update caching tests
68688b43
clefourrier
clefourrier
clefourrier
clefourrier simplified system with cleaner task_id
b463efff
clefourrier adapted to new functions
27be046c
clefourrier update vllm test
f7c62bbb
clefourrier fixing the metric changed res by 1 point
0c4a429c
clefourrier clefourrier changed the title MMLU Redux MMLU Redux and Fixing the caching 170 days ago
clefourrier byteorder arg
dbac859f
clefourrier this makes little sense
668e2f44
NathanHB
NathanHB commented on 2025-09-11
NathanHB
NathanHB commented on 2025-09-11
NathanHB
NathanHB commented on 2025-09-11
NathanHB
NathanHB approved these changes on 2025-09-11
clefourrier Update src/lighteval/utils/cache_management.py
ef5ffe74
clefourrier comments
085f59cf
clefourrier clefourrier merged eda7eed3 into main 170 days ago
NathanHB NathanHB added bug
NathanHB NathanHB added new-task

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone