Can Machines Imitate Humans? Integrative Turing Tests for Vision and Language Demonstrate a Narrowing Gap (2211.13087v2)

Published 23 Nov 2022 in cs.CV and cs.AI

Abstract: As AI algorithms increasingly participate in daily activities, it becomes critical to ascertain whether the agents we interact with are human or not. To address this question, we turn to the Turing test and systematically benchmark current AIs in their abilities to imitate humans in three language tasks (Image captioning, Word association, and Conversation) and three vision tasks (Object detection, Color estimation, and Attention prediction). The experiments involved 549 human agents plus 26 AI agents for dataset creation, and 1,126 human judges plus 10 AI judges, in 25,650 Turing-like tests. The results reveal that current AIs are not far from being able to impersonate humans in complex language and vision challenges. While human judges were often deceived, simple AI judges outperformed human judges in distinguishing human answers from AI answers. The results of imitation tests are only minimally correlated with standard performance metrics in AI. Thus, evaluating whether a machine can pass as a human constitutes an important independent test to evaluate AI algorithms. The curated, large-scale, Turing datasets introduced here and their evaluation metrics provide new benchmarks and insights to assess whether an agent is human or not and emphasize the relevance of rigorous, systematic, and quantitative imitation tests in these and other AI domains.

Authors (21)

Mengmi Zhang (35 papers)
Giorgia Dellaferrera (9 papers)
Ankur Sikarwar (6 papers)
Marcelo Armendariz (2 papers)
Noga Mudrik (9 papers)
Prachi Agrawal (3 papers)
Spandan Madan (12 papers)
Andrei Barbu (35 papers)
Haochen Yang (5 papers)
Tanishq Kumar (6 papers)
Meghna Sadwani (1 paper)
Stella Dellaferrera (1 paper)
Michele Pizzochero (30 papers)
Hanspeter Pfister (131 papers)
Gabriel Kreiman (45 papers)
Caishun Chen (7 papers)
Mranmay Shetty (1 paper)
Shui'Er Han (1 paper)
Aman Raj Singh (1 paper)
Brandon Tang (1 paper)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Can Machines Imitate Humans? Integrative Turing Tests for Vision and Language Demonstrate a Narrowing Gap (2211.13087v2)

Summary

Related Papers