Talking About Large Language Models (2212.03551v5)

Published 7 Dec 2022 in cs.CL and cs.LG

Abstract: Thanks to rapid progress in artificial intelligence, we have entered an era when technology and philosophy intersect in interesting ways. Sitting squarely at the centre of this intersection are LLMs. The more adept LLMs become at mimicking human language, the more vulnerable we become to anthropomorphism, to seeing the systems in which they are embedded as more human-like than they really are. This trend is amplified by the natural tendency to use philosophically loaded terms, such as "knows", "believes", and "thinks", when describing these systems. To mitigate this trend, this paper advocates the practice of repeatedly stepping back to remind ourselves of how LLMs, and the systems of which they form a part, actually work. The hope is that increased scientific precision will encourage more philosophical nuance in the discourse around artificial intelligence, both within the field and in the public sphere.

Citations (183)

View on Semantic Scholar

Summary

The paper presents a critical analysis of how scaling LLMs improves performance while cautioning against anthropomorphism.
It highlights the misleading use of terms like 'knows' or 'believes' in describing LLMs, advocating for clearer scientific language.
The study discusses future AI designs that integrate external reality to enhance trust and accuracy in system capabilities.

Examination of "Talking About LLMs"

Murray Shanahan's paper explores the intersection of technology and philosophy, centered around the capabilities of LLMs. The paper presents a critical analysis of our anthropomorphic tendencies when engaging with LLMs, and it argues for a more scientifically precise discourse, aiming to refine philosophical understanding in the field of AI.

Core Insights

The paper underscores several key observations about LLMs:

Scaling and Performance: LLMs, like GPT-3 and others, demonstrate improved performance as the size of the training dataset and model increase. This scalability also leads to qualitative leaps in capabilities, bringing human-like language mimicry into sharper focus.
Mimicking Human Language: Shanahan discusses the anthropomorphic trap we fall into when LLMs deliver human-like responses. While humans intuitively assign human traits to LLMs, the underlying operations of these models remain fundamentally mechanical — predicting statistical continuations of token sequences.
Intentional Stance: The paper evaluates the use of familiar psychological terms such as "knows" or "believes" when describing LLMs. Such language, while convenient, can mislead and encourage overly human-like perceptions of AI functionality.
External Reality and Truth: Shanahan argues that LLMs lack a mechanism to truly "know" or "believe," as they cannot engage with external reality or distinguish truth in human terms. The lack of such mechanisms necessitates caution in linguistic ascriptions of understanding or belief to LLMs.

Implications and Future Directions

The implications of Shanahan’s arguments are both theoretical and practical:

Theoretical Refinement: The paper pushes for refinement in how we philosophically and linguistically frame AI systems. It suggests developing language that accurately reflects the capabilities and limitations of LLMs without falling into the trap of anthropomorphism.
Policy and Communication: For policymakers and the broader public, avoiding misleading representations of AI capabilities becomes essential to crafting reasonable expectations and regulations.
System Design and Trust: Embedding LLMs within larger systems that utilize factual external resources could move us closer to agents that display a form of "belief." However, this hinges on careful system design and the integration of robust mechanisms for interacting with reality.

Future Developments in AI

Anticipated future developments might include:

Advanced Embodiment: Future systems may see LLMs integrated into embodied agents capable of more interactive and meaningful engagements with their environment.
Enhanced Trust Mechanisms: Developing methods that ensure AI systems are faithful in executing logic-based tasks could augment trust in AI applications, potentially bridging the gap between artificial reasoning and human-like understanding.
Evolving Language Frameworks: With the continued assimilation of AI into human contexts, language describing AI capabilities may evolve, possibly introducing bespoke terminology that suits AI’s unique mechanics.

In "Talking About LLMs," Shanahan offers a precise examination of LLMs, recommending caution against anthropomorphism and urging clarity in AI discourse. This approach lays a foundation for more nuanced interactions with AI, fostering better understanding and trust between humans and machines.

PDF Markdown

Related Papers

Tweets

https://twitter.com/zpete/status/1764864566012346699

https://twitter.com/sunbains/status/1762160783558119546

https://twitter.com/gconstantinides/status/1773680157661782269

https://twitter.com/marclauritsen/status/1762900706401394801

https://twitter.com/stungeye/status/1765100011186987321

https://twitter.com/solaverbo/status/1783631304459727139

YouTube

Show All Videos

HackerNews

Talking About Large Language Models (LLMs) (1 point, 0 comments)