Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Response: Emergent analogical reasoning in large language models (2308.16118v2)

Published 30 Aug 2023 in cs.CL and cs.AI

Abstract: In their recent Nature Human Behaviour paper, "Emergent analogical reasoning in LLMs," (Webb, Holyoak, and Lu, 2023) the authors argue that "LLMs such as GPT-3 have acquired an emergent ability to find zero-shot solutions to a broad range of analogy problems." In this response, we provide counterexamples of the letter string analogies. In our tests, GPT-3 fails to solve simplest variations of the original tasks, whereas human performance remains consistently high across all modified versions. Zero-shot reasoning is an extraordinary claim that requires extraordinary evidence. We do not see that evidence in our experiments. To strengthen claims of humanlike reasoning such as zero-shot reasoning, it is important that the field develop approaches that rule out data memorization.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Damian Hodel (3 papers)
  2. Jevin West (14 papers)
Citations (10)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets