Abstraction and Analogy-Making in Artificial Intelligence (2102.10717v2)

Published 22 Feb 2021 in cs.AI

Abstract: Conceptual abstraction and analogy-making are key abilities underlying humans' abilities to learn, reason, and robustly adapt their knowledge to new domains. Despite of a long history of research on constructing AI systems with these abilities, no current AI system is anywhere close to a capability of forming humanlike abstractions or analogies. This paper reviews the advantages and limitations of several approaches toward this goal, including symbolic methods, deep learning, and probabilistic program induction. The paper concludes with several proposals for designing challenge tasks and evaluation measures in order to make quantifiable and generalizable progress in this area.

Citations (147)

View on Semantic Scholar

Summary

The paper provides a comprehensive review of symbolic methods, active symbol architectures, deep learning, and probabilistic program induction in analogy-making.
It evaluates the integration of representation building with mapping processes, highlighting both the strengths and limitations of systems like Copycat and SMEs.
It concludes with proposals for future AI research, emphasizing idealized domains, core knowledge frameworks, and robust performance metrics.

Abstraction and Analogy-Making in Artificial Intelligence

Melanie Mitchell's paper provides a comprehensive review and analysis of the ongoing challenges and approaches within the field of abstraction and analogy-making in AI. It contextualizes the research within a historical framework—from the earliest AI propositions by McCarthy et al. to today's advanced learning systems. This paper underscores the critical gap between current AI capabilities and the human faculty of forming abstractions or analogies, thereby setting the stage for examining various methodological paradigms.

The paper dissects several prominent approaches in the AI landscape:

Symbolic Methods: This category explores the utility of structured symbolic representations. A central component of this discussion is the Structure-Mapping Engine (SME), a model reflecting Gentner's theory of analogy-making based on syntactic structures. Despite SME's ability to construct mappings between logical domains, its reliance on syntax over semantics presents a major limitation. Moreover, the separation of representation-building from mapping and the semi-exhaustive nature of its search process limit its applicability to more complex analogies.
Active Symbol Architectures: Highlighted by Hofstadter and Mitchell's Copycat program, this method intertwines representation building and mapping processes. It fosters emergent dynamical systems that integrate symbolic and subsymbolic elements. Copycat's effort to mimic high-level perceptual and analogy-making processes elucidates the importance of interleaving representation-building with mapping. However, limitations remain, notably the manual creation of its concept network and lack of self-directed learning capabilities.
Deep Learning Approaches: The paper taps into deep neural networks' ability to learn from raw data without pre-specified symbolic structures. It engages extensively with the application of deep learning to Raven’s Progressive Matrices (RPMs), showcasing both its capabilities and pitfalls. While networks like GSM and WReN demonstrated competence, they often rely on statistical shortcuts, revealing a lack of true abstraction skills. The necessity for extensive training data further distances these models from truly reflecting human cognitive processes in abstraction and analogy.
Probabilistic Program Induction: This methodology treats abstraction and concept learning as tasks of generating programs. Program induction offers advantages like flexible abstraction, reusability, modularity, and interpretability but at the cost of requiring human-structured knowledge and being computationally intensive. Both the Omniglot task and Bongard problems showcase the potential and challenges inherent in probabilistic framing of cognitive tasks.

Mitchell concludes with a set of proposals aimed at refining and directing future research in AI-focused abstraction and analogy-making:

Focus on Idealized Domains: Idealized environments, stripped of excessive external complexities, better isolate cognitive phenomena for machine learning and AI to model accurate abstraction processes.
Core Knowledge Prioritization: By concentrating on human core conceptual frameworks (e.g., spatial geometry, intuitive physics), AI can develop basic capabilities that are transferable and foundational to complex cognitive tasks.
Robust Evaluation Metrics: Mitchell emphasizes that accuracy should not be the sole performance measure—robustness to noise and scaling problem complexity are equally crucial.
Generative and Minimal Training Tasks: Tasks should require minimal prior training, with a focus on generative problem-solving to assess innate problem-solving tactics devoid of biases inherent to excessive repetition or training.

Mitchell persuasively argues that the pathway to human-level AI does not lie in mere improvements to existing methodologies but requires a fundamental shift in how AI research tackles learning, abstraction, and analogy tasks. Her recommendations for future research intend to catalyze advancements that bridge the intrinsic capabilities gap between current AI systems and human cognitive processes. This paper stands as a compelling call to action, encouraging a strategic re-orientation toward the grand challenge of general AI.

PDF Markdown

Related Papers

Tweets

https://twitter.com/bayesianboy/status/1770218760122618316

YouTube

Show All Videos