AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Published 29 Dec 2025 in cs.CL, cs.AI, and cs.CV | (2512.23343v1)

Abstract: Memory serves as the pivotal nexus bridging past and future, providing both humans and AI systems with invaluable concepts and experience to navigate complex tasks. Recent research on autonomous agents has increasingly focused on designing efficient memory workflows by drawing on cognitive neuroscience. However, constrained by interdisciplinary barriers, existing works struggle to assimilate the essence of human memory mechanisms. To bridge this gap, we systematically synthesizes interdisciplinary knowledge of memory, connecting insights from cognitive neuroscience with LLM-driven agents. Specifically, we first elucidate the definition and function of memory along a progressive trajectory from cognitive neuroscience through LLMs to agents. We then provide a comparative analysis of memory taxonomy, storage mechanisms, and the complete management lifecycle from both biological and artificial perspectives. Subsequently, we review the mainstream benchmarks for evaluating agent memory. Additionally, we explore memory security from dual perspectives of attack and defense. Finally, we envision future research directions, with a focus on multimodal memory systems and skill acquisition.

Abstract PDF Chat (Pro)

Summary

The paper presents a unified survey bridging cognitive neuroscience and LLM-driven agents through detailed memory systems analysis.
It examines diverse memory types, storage mechanisms, and dynamic management methods across biological and artificial domains.
The study highlights methods to overcome context limitations, enable long-term personalization, and mitigate security threats in AI systems.

Memory Systems in Cognitive Neuroscience and Autonomous Agents: A Unified Survey

Introduction

"AI Meets Brain: A Unified Survey on Memory Systems from Cognitive Neuroscience to Autonomous Agents" (2512.23343) presents a comprehensive synthesis of memory research across cognitive neuroscience and LLM-driven agent frameworks. The survey charts foundational definitions, taxonomies, storage and management mechanisms, benchmarking protocols, security considerations, and future research directions, emphasizing deep parallels and multidirectional inspirations between biological and artificial memory systems.

Definitions and Functions of Memory

The survey delineates memory from three perspectives: cognitive neuroscience, LLMs, and autonomous agents.

From a neuroscientific stance, memory is characterized as a cognitive process involving the encoding, storage, consolidation, and retrieval of information, supporting adaptivity, learning, and foresight in dynamic environments. LLM memory is dissected into three categories: parametric memory (internalized in model weights), working memory (context window), and explicit external memory (auxiliary storage mechanisms such as RAG). Autonomous agent memory is positioned as a dynamic system, transcending mere storage, now entailing structured storage, dynamic scheduling, and evolving cognitive processing.

Functional Utility of Agent Memory

Memory augments LLM-driven agents by addressing several critical limitations and aspirations:

Alleviating Context Window Constraints: Through structured information management -- both heuristic and learnable, e.g., context folding and reinforcement-optimized summarization -- infinite interaction histories are mapped into finite, efficient representations, reducing computational overhead and mitigating phenomena like "lost-in-the-middle".
Facilitating Long-term Personalization: Memory supports the formation of persistent, evolving user profiles and enables agents to align decision-making with user-specific preferences and historical behaviors, supporting both static and online adaptation.
Enabling Experience-based Reasoning: By instantiating procedural and strategic memory, agents execute both strategic guidance (retrieval of instructive precedents, trajectories, or guidelines) and procedural solidification (distillation of workflows, templates, or executable skill libraries), closing the gap between static LLMs and continually learning entities.
Figure 1: Memory utility in LLM-driven agents: overcoming context constraints, building long-term personalization, and enabling feedback-driven planning and reasoning.

Taxonomy of Memory Systems

A dual-axis classification is put forth:

Nature-based Taxonomy: Episodic memory (procedural, interactional, tool-augmented trajectories) vs. Semantic memory (fact-based, declarative, knowledge-centric content).
Scope-based Taxonomy: Inside-trail (ephemeral, within-episode) vs. Cross-trail (persistent, multi-session/trajectory) memory, corresponding to short-term and long-term dynamics.
Figure 2: (a) Nature-based taxonomy: episodic vs. semantic; (b) Scope-based taxonomy: inside-trail vs. cross-trail memory in agents.

Storage Mechanisms: Cognition and Computation

Memory storage is explored across two dimensions: location and format.

Neuroscience: Short-term memory leverages distributed sensory-frontoparietal networks and is realized via persistent neural activity or activity-silent synaptic states. Long-term memory depends on hippocampal-neocortical coordination, with consolidation and abstraction into event-based and cognitive map structures.
Artificial Agents: Storage locations include context windows for ephemeral information and external memory banks for persistent knowledge. Representational formats comprise text, graphs, parametric (model weight) embedding, and latent vector-based structures.
Figure 3: Memory storage in cognitive neuroscience: locations and formats for short- and long-term memory.

Memory Management

Both biological and artificial systems implement dynamic, closed-loop management frameworks:

Neuroscience: Cycles of memory formation (encoding, consolidation, integration), updating (prediction-error driven, differentiation/integration), and retrieval (cue-triggered, reconsolidation-enabled adaptive retrieval). Notably, retrieval itself reopens plasticity windows for trace modification.
Figure 4: Dynamic cycle of memory management in cognitive neuroscience—formation, updating, and retrieval fostering flexible adaptation.
Agents: Management encompasses extraction (flat, hierarchical, generative paradigms), updating (inside-trail vs. cross-trail), retrieval (similarity-based and multifactorial, including importance, recency, Q-value-based), and application (context augmentation and parameter distillation). Autonomous memory operations increasingly leverage RL and self-reflection mechanisms.
Figure 5: Memory management pipeline for agents: extraction, updating, retrieval, and utilization for persistent experience regulation.

Benchmarking Agent Memory

Benchmarks are categorized as:

Semantic-oriented: Measuring fidelity, memory dynamics, and generalization via tasks focusing on knowledge retention, dynamic updating, and abstraction-driven transfer (e.g., LoCoMo, MemBench, PersonaMem, HaluMem, LifelongAgentBench).
Episodic-oriented: Evaluating practical task performance in vertical domains where memory is critical -- web interaction (WebChoreArena, WebArena), tool-use (ToolBench, GAIA), and embodied environments (BabyAI, ScienceWorld, Mind2Web).

The survey stresses that generalized benchmarks probe the transformation from conversationalist to competent executor or problem-solver, with robust memory as a prerequisite.

Security Considerations

Memory in agents expands the attack surface for both extraction-based (privacy leakage) and poisoning-based (backdoor/data manipulation) attacks. Defense strategies span:

Retrieval-level: Purification, anomaly detection, structural consensus validation.
Response-level: Multi-agent collaborative review and reasoning trajectory rehearsal.
Privacy-level: Anonymization, partitioned workspaces, context integrity analysis.

The survey highlights that while retrieval-augmented systems enhance temporal adaptivity, they are susceptible to nuanced attack vectors not present in static parametric memory.

Future Directions

Two priority avenues are emphasized:

Multimodal Memory Systems: The survey identifies open problems in semantic consistency and alignment for non-textual modalities; solutions are emerging through compression, symbolic abstraction, and hybrid memory representations.
Agent Skills and Memory Sharing: Modular skills—encapsulating procedures, instructions, and knowledge—are posited as critical for composability and cross-agent transfer. There is a recognized need for universal representations and APIs to facilitate cross-modal, cross-model, and cross-agent memory portability and transfer.

Conclusion

This survey establishes a unified conceptual framework connecting neuroscientific and artificial memory research, advocating for reciprocal inspiration between fields. It systematically analyzes theoretical foundations, taxonomies, management architectures, evaluation protocols, security risks, and future challenges, advocating for robust, human-like memory mechanisms to further the development of adaptive, resilient, and generalizable AI systems. The prospects for multimodal and sharable memory modules remain central to advancing autonomous agents beyond the limits of current architectures (2512.23343).