Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CRAB: Assessing the Strength of Causal Relationships Between Real-world Events (2311.04284v1)

Published 7 Nov 2023 in cs.CL and cs.AI

Abstract: Understanding narratives requires reasoning about the cause-and-effect relationships between events mentioned in the text. While existing foundation models yield impressive results in many NLP tasks requiring reasoning, it is unclear whether they understand the complexity of the underlying network of causal relationships of events in narratives. In this work, we present CRAB, a new Causal Reasoning Assessment Benchmark designed to evaluate causal understanding of events in real-world narratives. CRAB contains fine-grained, contextual causality annotations for ~2.7K pairs of real-world events that describe various newsworthy event timelines (e.g., the acquisition of Twitter by Elon Musk). Using CRAB, we measure the performance of several LLMs, demonstrating that most systems achieve poor performance on the task. Motivated by classical causal principles, we also analyze the causal structures of groups of events in CRAB, and find that models perform worse on causal reasoning when events are derived from complex causal structures compared to simple linear causal chains. We make our dataset and code available to the research community.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Angelika Romanou (11 papers)
  2. Syrielle Montariol (22 papers)
  3. Debjit Paul (18 papers)
  4. Leo Laugier (5 papers)
  5. Karl Aberer (44 papers)
  6. Antoine Bosselut (85 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.