Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ClassActionPrediction: A Challenging Benchmark for Legal Judgment Prediction of Class Action Cases in the US (2211.00582v1)

Published 1 Nov 2022 in cs.CL, cs.AI, cs.LG, and cs.NE

Abstract: The research field of Legal NLP has been very active recently, with Legal Judgment Prediction (LJP) becoming one of the most extensively studied tasks. To date, most publicly released LJP datasets originate from countries with civil law. In this work, we release, for the first time, a challenging LJP dataset focused on class action cases in the US. It is the first dataset in the common law system that focuses on the harder and more realistic task involving the complaints as input instead of the often used facts summary written by the court. Additionally, we study the difficulty of the task by collecting expert human predictions, showing that even human experts can only reach 53% accuracy on this dataset. Our Longformer model clearly outperforms the human baseline (63%), despite only considering the first 2,048 tokens. Furthermore, we perform a detailed error analysis and find that the Longformer model is significantly better calibrated than the human experts. Finally, we publicly release the dataset and the code used for the experiments.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Gil Semo (3 papers)
  2. Dor Bernsohn (3 papers)
  3. Ben Hagag (3 papers)
  4. Gila Hayat (2 papers)
  5. Joel Niklaus (21 papers)
Citations (17)