Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Can Machines Imitate Humans? Integrative Turing Tests for Vision and Language Demonstrate a Narrowing Gap (2211.13087v2)

Published 23 Nov 2022 in cs.CV and cs.AI

Abstract: As AI algorithms increasingly participate in daily activities, it becomes critical to ascertain whether the agents we interact with are human or not. To address this question, we turn to the Turing test and systematically benchmark current AIs in their abilities to imitate humans in three language tasks (Image captioning, Word association, and Conversation) and three vision tasks (Object detection, Color estimation, and Attention prediction). The experiments involved 549 human agents plus 26 AI agents for dataset creation, and 1,126 human judges plus 10 AI judges, in 25,650 Turing-like tests. The results reveal that current AIs are not far from being able to impersonate humans in complex language and vision challenges. While human judges were often deceived, simple AI judges outperformed human judges in distinguishing human answers from AI answers. The results of imitation tests are only minimally correlated with standard performance metrics in AI. Thus, evaluating whether a machine can pass as a human constitutes an important independent test to evaluate AI algorithms. The curated, large-scale, Turing datasets introduced here and their evaluation metrics provide new benchmarks and insights to assess whether an agent is human or not and emphasize the relevance of rigorous, systematic, and quantitative imitation tests in these and other AI domains.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (21)
  1. Mengmi Zhang (35 papers)
  2. Giorgia Dellaferrera (9 papers)
  3. Ankur Sikarwar (6 papers)
  4. Marcelo Armendariz (2 papers)
  5. Noga Mudrik (9 papers)
  6. Prachi Agrawal (3 papers)
  7. Spandan Madan (12 papers)
  8. Andrei Barbu (35 papers)
  9. Haochen Yang (5 papers)
  10. Tanishq Kumar (6 papers)
  11. Meghna Sadwani (1 paper)
  12. Stella Dellaferrera (1 paper)
  13. Michele Pizzochero (30 papers)
  14. Hanspeter Pfister (131 papers)
  15. Gabriel Kreiman (45 papers)
  16. Caishun Chen (7 papers)
  17. Mranmay Shetty (1 paper)
  18. Shui'Er Han (1 paper)
  19. Aman Raj Singh (1 paper)
  20. Brandon Tang (1 paper)
Citations (1)

Summary

We haven't generated a summary for this paper yet.