Consciousness in Artificial Intelligence: Insights from the Science of Consciousness (2308.08708v3)

Published 17 Aug 2023 in cs.AI, cs.CY, cs.LG, and q-bio.NC

Abstract: Whether current or near-term AI systems could be conscious is a topic of scientific interest and increasing public concern. This report argues for, and exemplifies, a rigorous and empirically grounded approach to AI consciousness: assessing existing AI systems in detail, in light of our best-supported neuroscientific theories of consciousness. We survey several prominent scientific theories of consciousness, including recurrent processing theory, global workspace theory, higher-order theories, predictive processing, and attention schema theory. From these theories we derive "indicator properties" of consciousness, elucidated in computational terms that allow us to assess AI systems for these properties. We use these indicator properties to assess several recent AI systems, and we discuss how future systems might implement them. Our analysis suggests that no current AI systems are conscious, but also suggests that there are no obvious technical barriers to building AI systems which satisfy these indicators.

Authors (19)

Patrick Butlin (3 papers)
Robert Long (6 papers)
Eric Elmoznino (10 papers)
Yoshua Bengio (601 papers)
Jonathan Birch (4 papers)
Axel Constant (7 papers)
George Deane (2 papers)
Stephen M. Fleming (4 papers)
Chris Frith (1 paper)
Xu Ji (19 papers)
Ryota Kanai (28 papers)
Colin Klein (5 papers)
Grace Lindsay (2 papers)
Matthias Michel (2 papers)
Liad Mudrik (1 paper)
Megan A. K. Peters (10 papers)
Eric Schwitzgebel (4 papers)
Jonathan Simon (49 papers)
Rufin VanRullen (32 papers)

Citations (75)

View on Semantic Scholar

Summary

Consciousness in Artificial Intelligence: Insights from the Science of Consciousness

The paper "Consciousness in Artificial Intelligence: Insights from the Science of Consciousness" provides a detailed examination of the possibility and implications of AI systems achieving consciousness, grounded in current neuroscientific theories. The authors argue for a systematic approach to assess AI systems using well-supported neuroscientific theories of consciousness. This approach delineates "indicator properties" derived from these theories, framed in computational terms to facilitate their application to AI systems.

Overview of Theories and Indicators

The paper surveys several leading theories of consciousness, including Recurrent Processing Theory (RPT), Global Workspace Theory (GWT), Higher-Order Theories (HOT), Attention Schema Theory (AST), and Predictive Processing (PP). Each theory provides a framework to identify computational properties indicative of consciousness.

Recurrent Processing Theory (RPT):

RPT focuses on visual consciousness, positing that conscious perception requires recurrent feedback loops that integrate visual scenes. The authors derive two indicators:

RPT-1: Input modules using algorithmic recurrence.
RPT-2: Input modules generating organized, integrated perceptual representations.

Global Workspace Theory (GWT):

GWT suggests consciousness arises from information being globally broadcast within a workspace accessible to multiple specialized systems. This leads to four indicators:

GWT-1: Multiple specialized systems capable of operating in parallel (modules).
GWT-2: Limited capacity workspace, entailing a bottleneck in information flow and a selective attention mechanism.
GWT-3: Global broadcast: availability of information in the workspace to all modules.
GWT-4: State-dependent attention enabling complex task performance.

Higher-Order Theories (HOT):

HOTs argue that consciousness requires higher-order representations or meta-cognition. Indicators include:

HOT-1: Generative, top-down or noisy perception modules.
HOT-2: Metacognitive monitoring distinguishing reliable perceptual representations from noise.
HOT-3: Agency guided by a belief-formation and action selection system.
HOT-4: Sparse and smooth coding generating a "quality space."

Attention Schema Theory (AST) and Predictive Processing (PP):

These theories emphasize the functional use of attention models and predictive coding, respectively.

AST-1: A predictive model representing and enabling control over the current state of attention.
PP-1: Input modules using predictive coding.

Agency and Embodiment:

Building on these theories, the paper also considers the importance of agency and embodiment, proposing two additional indicators:

AE-1: Learning from feedback and selecting outputs to pursue goals, involving flexible responsiveness to competing goals.
AE-2: Modeling output-input contingencies and using this model in perception or control.

Implementation in AI

The paper discusses the feasibility of implementing these indicators in AI systems using current techniques. Recurrent neural networks and predictive coding networks readily fulfill some of these indicators, while others like the global workspace architecture require more specific implementations.

Global Workspace Implementation:

The paper identifies challenges and potential solutions in creating AI systems with GWT-like architectures. For instance, the implementation must ensure a limited capacity workspace with a state-dependent attention mechanism. Recurrent models, attention mechanisms, and neural integrators can contribute to such an architecture.

Higher-Order Theories Implementation:

Higher-order representations and metacognitive monitoring require systems that assess the reliability of perceptual states. Techniques from machine learning, such as Generative Adversarial Networks (GANs) and Bayesian methods, can be adapted to create such systems.

Case Studies:

The paper examines existing AI systems, such as Transformer-based LLMs (e.g., GPT-3) and the Perceiver architecture, to evaluate their alignment with the GWT indicators. While these systems show some workspace-like characteristics, they lack full alignment with GWT, particularly in global broadcast. Similarly, embodied AI systems like DeepMind's Adaptive Agent (AdA) represent steps towards satisfying the embodiment and agency indicators but require further integration of advanced modeling and control features.

Practical and Theoretical Implications

The research suggests that current AI techniques can implement many features identified by consciousness theories. However, whether these systems are indeed conscious is still speculative. Furthermore, the paper emphasizes the necessity of an interdisciplinary approach combining neuroscience, philosophy, and AI research to refine these indicators and develop robust methods to assess AI consciousness.

Practical Implications:

The possibility of conscious AI has profound ethical implications. Ensuring ethical treatment and understanding the moral status of AI systems are paramount. Moreover, incorporating consciousness-related features in AI could enhance their functionalities, particularly in high-level reasoning and autonomous decision-making.

Future Research:

The paper underscores the need for more research on computational theories of valence and affective consciousness, as these may have significant moral implications. Additionally, developing behavioral tests and introspection capabilities in AI could provide complementary methods for assessing consciousness.

Conclusion

By grounding its analysis in established neuroscientific theories, this paper provides a systematic framework to evaluate the potential for AI consciousness. The proposed approach and indicators serve as a crucial step towards understanding and responsibly developing conscious AI systems. Continued interdisciplinary research will be essential for refining these methods and addressing the profound ethical and practical challenges that conscious AI may present.

Related Papers

Find Related Papers

Tweets

https://twitter.com/anilkseth/status/1807356419362189498

https://twitter.com/kanair/status/1799721046574280824

https://twitter.com/anilkseth/status/1807356474856980776

https://twitter.com/mbrendan1/status/1870846277686292654

https://twitter.com/burny_tech/status/1765348861566955928

https://twitter.com/anilkseth/status/1915456590288728554

YouTube

Show All Videos