huggingface/lighteval

Pull Requests Commits

Merge branch 'main' into nathan-bump-lighteval-version

clefourrier committed 2 years ago

Verified 3f2e90a1

Change the eos condition for GSM8K (#85)

clefourrier committed 2 years ago

Verified 9b3813ff

Fix parallel data processing bug (#92)

clefourrier committed 2 years ago

Verified 3b0aa23d

add license header to src files (#89)

NathanHB committed 2 years ago

Verified 2d529ac8

Sets a max length for the MATH task (#83)

clefourrier committed 2 years ago

Verified 458d50b0

bump git python (#90)

NathanHB committed 2 years ago

Verified 5ba92603

update pyproject ruff config to match new version

Nathan Habib committed 2 years ago

410fd908

Tidy up dependency groups (#81)

lewtun committed 2 years ago

Verified 7bf40877

Create LICENSE (#86)

clefourrier committed 2 years ago

Verified 927e63ef

Merge branch 'main' into nathan-bump-lighteval-version

NathanHB committed 2 years ago

Verified d22538ed

Upgrade huggingface_hub to fix datasets import and add trust_remote_code in datasets (#84)

clefourrier committed 2 years ago

Verified 9ecab065

Release: v0.2.0

Nathan Habib committed 2 years ago

bc7a3dbe

Relax sentencepiece version (#74)

lewtun committed 2 years ago

Verified b9d02770

Update ruff (#71)

clefourrier committed 2 years ago

Verified 030945b1

Now manages no generation size is set in a generative task description (#76)

clefourrier committed 2 years ago

Verified e49585da

Fixes wikitext prompts + some patches on tg models (#64)

clefourrier committed 2 years ago

Verified cabef7c4

Adding custom metric system + IFEval as an example (#48)

clefourrier committed 2 years ago

Verified acffc1a8

Just adding the custom metrics system (#65)

clefourrier committed 2 years ago

Verified 3785d852

Fixes chat template application to choices (#67)

clefourrier committed 2 years ago

Verified 49074998

Remove the eos token override in the Default Config Task (#54)

clefourrier committed 2 years ago

Verified 449817f6

thomwolf committed 2 years ago

Verified 589e6b0d

Update leaderboard task set (#60)

lewtun committed 2 years ago

Verified 6a3e3b92

Tweak installation / usage sections of README (#55)

lewtun committed 2 years ago

Verified 480d85ef

Adding support for Arabic benchmarks : AceGPT benchmarking suite (#44)

alielfilali01 committed 2 years ago

Verified 090101f1

New mechanism for evaluation contributions (#47)

clefourrier committed 2 years ago

Verified 92e9b505

clefourrier committed 2 years ago

Verified 831ad47b

Improve the current chat template system (#38)

clefourrier committed 2 years ago

Verified 81fc8fda

bump transformers to 4.38 (#46)

NathanHB committed 2 years ago

Verified fb57ffc6

Add an automatic system to compute average for tasks with subtasks

clefourrier committed 2 years ago

Verified 62abc78c

Update README.md (#37)

clefourrier committed 2 years ago

Verified 77c20164

Older