Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Constructing Hierarchical Q&A Datasets for Video Story Understanding (1904.00623v1)

Published 1 Apr 2019 in cs.AI, cs.CV, cs.LG, and cs.MM

Abstract: Video understanding is emerging as a new paradigm for studying human-like AI. Question-and-Answering (Q&A) is used as a general benchmark to measure the level of intelligence for video understanding. While several previous studies have suggested datasets for video Q&A tasks, they did not really incorporate story-level understanding, resulting in highly-biased and lack of variance in degree of question difficulty. In this paper, we propose a hierarchical method for building Q&A datasets, i.e. hierarchical difficulty levels. We introduce three criteria for video story understanding, i.e. memory capacity, logical complexity, and DIKW (Data-Information-Knowledge-Wisdom) pyramid. We discuss how three-dimensional map constructed from these criteria can be used as a metric for evaluating the levels of intelligence relating to video story understanding.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Yu-Jung Heo (14 papers)
  2. Kyoung-Woon On (19 papers)
  3. Seongho Choi (9 papers)
  4. Jaeseo Lim (5 papers)
  5. Jinah Kim (1 paper)
  6. Jeh-Kwang Ryu (6 papers)
  7. Byung-Chull Bae (2 papers)
  8. Byoung-Tak Zhang (83 papers)
Citations (4)