Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events (2308.10441v1)

Published 21 Aug 2023 in cs.AI and cs.CV

Abstract: Intuitive physics is pivotal for human understanding of the physical world, enabling prediction and interpretation of events even in infancy. Nonetheless, replicating this level of intuitive physics in AI remains a formidable challenge. This study introduces X-VoE, a comprehensive benchmark dataset, to assess AI agents' grasp of intuitive physics. Built on the developmental psychology-rooted Violation of Expectation (VoE) paradigm, X-VoE establishes a higher bar for the explanatory capacities of intuitive physics models. Each VoE scenario within X-VoE encompasses three distinct settings, probing models' comprehension of events and their underlying explanations. Beyond model evaluation, we present an explanation-based learning system that captures physics dynamics and infers occluded object states solely from visual sequences, without explicit occlusion labels. Experimental outcomes highlight our model's alignment with human commonsense when tested against X-VoE. A remarkable feature is our model's ability to visually expound VoE events by reconstructing concealed scenes. Concluding, we discuss the findings' implications and outline future research directions. Through X-VoE, we catalyze the advancement of AI endowed with human-like intuitive physics capabilities.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Bo Dai (245 papers)
  2. Linge Wang (1 paper)
  3. Baoxiong Jia (35 papers)
  4. Zeyu Zhang (143 papers)
  5. Song-Chun Zhu (216 papers)
  6. Chi Zhang (567 papers)
  7. Yixin Zhu (102 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.