PR #1032 Legal NLP tasks on Swiss data

Legal NLP tasks on Swiss data #1032

JoelNiklaus merged 16 commits into huggingface:main from rolshoven:community_task_slds

rolshoven force pushed from 481bc4f5 to f3a626dc 208 days ago

Legal NLP tasks on Swiss data

83c8a079

refactor: split Swiss legal multilingual tasks into modular package

8d19996d

refactor: update higher_is_better type in MetricGrouping

4032ce45

refactor: Updated prompts and implementation to match the latest SwiL…

ca7e8342

refactor: Enhance COMET and GEMBA metric loading with error handling

606417b8

Add Gemba dependency for Swiss legal evaluations and remove `suite` p…

8c449c2c

Fix batched metric aggregation for grouped metric names

9113d537

Fixed missing system prompt

6d3fdf3b

Judge models now are used through OpenRouter

bba0e914

fix reasoning model token handling when max_tokens is unset

cc22ecb6

rolshoven force pushed from ac784b97 to cc22ecb6 71 days ago

chore: trigger PR update

a4e0ba13

Merge branch 'main' into community_task_slds

406953fd

NathanHB commented on 2026-05-20

NathanHB approved these changes on 2026-05-20

fix: return raw score for BLEU, CHRF, and TER metrics instead of scal…

052586aa

fix: replaced accidental default value assignment with intended type …

fa341ac9

fix: add error handling for unsupported languages in Swiss Landmark D…

011c4047

fix: avoid huge negative BERTScore from baseline rescaling

df00dc16

JoelNiklaus merged 8d29839e into main 7 days ago

Reviewers

NathanHB

Assignees

No one assigned

Labels

None yet

Milestone

No milestone

lighteval Legal NLP tasks on Swiss data #1032 Merged

Legal NLP tasks on Swiss data #1032

lighteval
Legal NLP tasks on Swiss data
#1032

Merged