huggingface/lighteval

Pull Requests Commits

NathanHB committed 258 days ago

57ca0e53

NathanHB committed 258 days ago

2dab2bfd

NathanHB committed 258 days ago

083fb1b5

NathanHB committed 258 days ago

74b40f62

NathanHB committed 258 days ago

81081cde

add back the custom tasks

NathanHB committed 258 days ago

a326ea86

add back the custom tasks

NathanHB committed 258 days ago

764de725

NathanHB committed 259 days ago

b8c26dc2

NathanHB committed 259 days ago

970f33bf

NathanHB committed 259 days ago

8800d1ac

NathanHB committed 259 days ago

e94fa590

remove uneeded file

NathanHB committed 259 days ago

6806bf88

NathanHB committed 259 days ago

242bc438

load-tasks multilingual fix

NathanHB committed 259 days ago

aad136c1

remove custom tasks and let user decide if loading multilingual tasks

NathanHB committed 259 days ago

74e5c0f4

NathanHB committed 259 days ago

15f41f26

NathanHB committed 259 days ago

1435e382

revert uneeded changes

NathanHB committed 259 days ago

6a0c615d

Merge remote-tracking branch 'origin/main' into nathan-reorg-tasks

NathanHB committed 259 days ago

f53bd76f

move community tasks to default tasks and update doc

NathanHB committed 259 days ago

5445f5c0

move community tasks to default tasks and update doc

NathanHB committed 259 days ago

88754bfa

loads all tasks correclty

NathanHB committed 260 days ago

6447ee75

use TASKS_TABLE for default tasks

NathanHB committed 260 days ago

e439f706

use TASKS_TABLE for default tasks

NathanHB committed 260 days ago

c3911fcf

use TASKS_TABLE for multilingual tasks

NathanHB committed 260 days ago

b7345327

add name and dataset to metadata

NathanHB committed 260 days ago

81d9e4ed

Fixing naming for sample evals + adding reqs in aime24 (#989)

clefourrier committed 261 days ago

Verified 161d47cc

added fallback for incomplete configs for vlm models launched as llms (#828)

clefourrier committed 261 days ago

Verified e7d885c3

Fix 999: always provide parameters in the metric name to allow using several combinations (#1017)

clefourrier committed 261 days ago

Verified 70acb852

Fix nltk import failing (#1013)

clefourrier committed 261 days ago

Verified 3af89255

Older