- The paper presents a unified survey bridging cognitive neuroscience and LLM-driven agents through detailed memory systems analysis.
- It examines diverse memory types, storage mechanisms, and dynamic management methods across biological and artificial domains.
- The study highlights methods to overcome context limitations, enable long-term personalization, and mitigate security threats in AI systems.
Memory Systems in Cognitive Neuroscience and Autonomous Agents: A Unified Survey
Introduction
"AI Meets Brain: A Unified Survey on Memory Systems from Cognitive Neuroscience to Autonomous Agents" (2512.23343) presents a comprehensive synthesis of memory research across cognitive neuroscience and LLM-driven agent frameworks. The survey charts foundational definitions, taxonomies, storage and management mechanisms, benchmarking protocols, security considerations, and future research directions, emphasizing deep parallels and multidirectional inspirations between biological and artificial memory systems.
Definitions and Functions of Memory
The survey delineates memory from three perspectives: cognitive neuroscience, LLMs, and autonomous agents.
From a neuroscientific stance, memory is characterized as a cognitive process involving the encoding, storage, consolidation, and retrieval of information, supporting adaptivity, learning, and foresight in dynamic environments. LLM memory is dissected into three categories: parametric memory (internalized in model weights), working memory (context window), and explicit external memory (auxiliary storage mechanisms such as RAG). Autonomous agent memory is positioned as a dynamic system, transcending mere storage, now entailing structured storage, dynamic scheduling, and evolving cognitive processing.
Functional Utility of Agent Memory
Memory augments LLM-driven agents by addressing several critical limitations and aspirations:
- Alleviating Context Window Constraints: Through structured information management -- both heuristic and learnable, e.g., context folding and reinforcement-optimized summarization -- infinite interaction histories are mapped into finite, efficient representations, reducing computational overhead and mitigating phenomena like "lost-in-the-middle".
- Facilitating Long-term Personalization: Memory supports the formation of persistent, evolving user profiles and enables agents to align decision-making with user-specific preferences and historical behaviors, supporting both static and online adaptation.
- Enabling Experience-based Reasoning: By instantiating procedural and strategic memory, agents execute both strategic guidance (retrieval of instructive precedents, trajectories, or guidelines) and procedural solidification (distillation of workflows, templates, or executable skill libraries), closing the gap between static LLMs and continually learning entities.
Figure 1: Memory utility in LLM-driven agents: overcoming context constraints, building long-term personalization, and enabling feedback-driven planning and reasoning.
Taxonomy of Memory Systems
A dual-axis classification is put forth:
Storage Mechanisms: Cognition and Computation
Memory storage is explored across two dimensions: location and format.
Memory Management
Both biological and artificial systems implement dynamic, closed-loop management frameworks:
- Neuroscience: Cycles of memory formation (encoding, consolidation, integration), updating (prediction-error driven, differentiation/integration), and retrieval (cue-triggered, reconsolidation-enabled adaptive retrieval). Notably, retrieval itself reopens plasticity windows for trace modification.
Figure 4: Dynamic cycle of memory management in cognitive neuroscience—formation, updating, and retrieval fostering flexible adaptation.
- Agents: Management encompasses extraction (flat, hierarchical, generative paradigms), updating (inside-trail vs. cross-trail), retrieval (similarity-based and multifactorial, including importance, recency, Q-value-based), and application (context augmentation and parameter distillation). Autonomous memory operations increasingly leverage RL and self-reflection mechanisms.
Figure 5: Memory management pipeline for agents: extraction, updating, retrieval, and utilization for persistent experience regulation.
Benchmarking Agent Memory
Benchmarks are categorized as:
- Semantic-oriented: Measuring fidelity, memory dynamics, and generalization via tasks focusing on knowledge retention, dynamic updating, and abstraction-driven transfer (e.g., LoCoMo, MemBench, PersonaMem, HaluMem, LifelongAgentBench).
- Episodic-oriented: Evaluating practical task performance in vertical domains where memory is critical -- web interaction (WebChoreArena, WebArena), tool-use (ToolBench, GAIA), and embodied environments (BabyAI, ScienceWorld, Mind2Web).
The survey stresses that generalized benchmarks probe the transformation from conversationalist to competent executor or problem-solver, with robust memory as a prerequisite.
Security Considerations
Memory in agents expands the attack surface for both extraction-based (privacy leakage) and poisoning-based (backdoor/data manipulation) attacks. Defense strategies span:
- Retrieval-level: Purification, anomaly detection, structural consensus validation.
- Response-level: Multi-agent collaborative review and reasoning trajectory rehearsal.
- Privacy-level: Anonymization, partitioned workspaces, context integrity analysis.
The survey highlights that while retrieval-augmented systems enhance temporal adaptivity, they are susceptible to nuanced attack vectors not present in static parametric memory.
Future Directions
Two priority avenues are emphasized:
- Multimodal Memory Systems: The survey identifies open problems in semantic consistency and alignment for non-textual modalities; solutions are emerging through compression, symbolic abstraction, and hybrid memory representations.
- Agent Skills and Memory Sharing: Modular skills—encapsulating procedures, instructions, and knowledge—are posited as critical for composability and cross-agent transfer. There is a recognized need for universal representations and APIs to facilitate cross-modal, cross-model, and cross-agent memory portability and transfer.
Conclusion
This survey establishes a unified conceptual framework connecting neuroscientific and artificial memory research, advocating for reciprocal inspiration between fields. It systematically analyzes theoretical foundations, taxonomies, management architectures, evaluation protocols, security risks, and future challenges, advocating for robust, human-like memory mechanisms to further the development of adaptive, resilient, and generalizable AI systems. The prospects for multimodal and sharable memory modules remain central to advancing autonomous agents beyond the limits of current architectures (2512.23343).