Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset (2010.11997v1)

Published 8 Oct 2020 in cs.HC, cs.CL, cs.CV, and cs.SI

Abstract: Modern social intelligence includes the ability to watch videos and answer questions about social and theory-of-mind-related content, e.g., for a scene in Harry Potter, "Is the father really upset about the boys flying the car?" Social visual question answering (social VQA) is emerging as a valuable methodology for studying social reasoning in both humans (e.g., children with autism) and AI agents. However, this problem space spans enormous variations in both videos and questions. We discuss methods for creating and characterizing social VQA datasets, including 1) crowdsourcing versus in-house authoring, including sample comparisons of two new datasets that we created (TinySocial-Crowd and TinySocial-InHouse) and the previously existing Social-IQ dataset; 2) a new rubric for characterizing the difficulty and content of a given video; and 3) a new rubric for characterizing question types. We close by describing how having well-characterized social VQA datasets will enhance the explainability of AI agents and can also inform assessments and educational interventions for people.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Zhanwen Chen (2 papers)
  2. Shiyao Li (17 papers)
  3. Roxanne Rashedi (1 paper)
  4. Xiaoman Zi (1 paper)
  5. Morgan Elrod-Erickson (1 paper)
  6. Bryan Hollis (1 paper)
  7. Angela Maliakal (1 paper)
  8. Xinyu Shen (8 papers)
  9. Simeng Zhao (1 paper)
  10. Maithilee Kunda (15 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.