Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
72 tokens/sec
GPT-4o
61 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

One Law, Many Languages: Benchmarking Multilingual Legal Reasoning for Judicial Support (2306.09237v3)

Published 15 Jun 2023 in cs.CL, cs.AI, and cs.LG

Abstract: Recent strides in LLMs have saturated many NLP benchmarks, emphasizing the need for more challenging ones to properly assess LLM capabilities. However, domain-specific and multilingual benchmarks are rare because they require in-depth expertise to develop. Still, most public models are trained predominantly on English corpora, while other languages remain understudied, particularly for practical domain-specific NLP tasks. In this work, we introduce a novel NLP benchmark for the legal domain that challenges LLMs in five key dimensions: processing \emph{long documents} (up to 50K tokens), using \emph{domain-specific knowledge} (embodied in legal texts), \emph{multilingual} understanding (covering five languages), \emph{multitasking} (comprising legal document-to-document Information Retrieval, Court View Generation, Leading Decision Summarization, Citation Extraction, and eight challenging Text Classification tasks) and \emph{reasoning} (comprising especially Court View Generation, but also the Text Classification tasks). Our benchmark contains diverse datasets from the Swiss legal system, allowing for a comprehensive study of the underlying non-English, inherently multilingual legal system. Despite the large size of our datasets (some with hundreds of thousands of examples), existing publicly available multilingual models struggle with most tasks, even after extensive in-domain pre-training and fine-tuning. We publish all resources (benchmark suite, pre-trained models, code) under permissive open CC BY-SA licenses.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Vishvaksenan Rasiah (2 papers)
  2. Ronja Stern (3 papers)
  3. Veton Matoshi (3 papers)
  4. Matthias Stürmer (13 papers)
  5. Ilias Chalkidis (40 papers)
  6. Daniel E. Ho (45 papers)
  7. Joel Niklaus (21 papers)
  8. Srinanda Brügger Bose (2 papers)
Citations (8)
X Twitter Logo Streamline Icon: https://streamlinehq.com