Learning to Remember: A Synaptic Plasticity Driven Framework for Continual Learning (1904.03137v4)

Published 5 Apr 2019 in cs.NE, cs.CV, and cs.LG

Abstract: Models trained in the context of continual learning (CL) should be able to learn from a stream of data over an undefined period of time. The main challenges herein are: 1) maintaining old knowledge while simultaneously benefiting from it when learning new tasks, and 2) guaranteeing model scalability with a growing amount of data to learn from. In order to tackle these challenges, we introduce Dynamic Generative Memory (DGM) - a synaptic plasticity driven framework for continual learning. DGM relies on conditional generative adversarial networks with learnable connection plasticity realized with neural masking. Specifically, we evaluate two variants of neural masking: applied to (i) layer activations and (ii) to connection weights directly. Furthermore, we propose a dynamic network expansion mechanism that ensures sufficient model capacity to accommodate for continually incoming tasks. The amount of added capacity is determined dynamically from the learned binary mask. We evaluate DGM in the continual class-incremental setup on visual classification tasks.

Citations (269)

View on Semantic Scholar

Summary

The paper introduces a novel framework driven by synaptic plasticity that integrates dynamic generative memory with adaptive network expansion for continual learning.
It employs neural masking on both layer activations and connection weights to selectively retain prior knowledge while incorporating new data.
Experimental results in visual classification tasks validate the approach by demonstrating effective balance between memory retention and new task adaptation.

Overview of "Learning to Remember: A Synaptic Plasticity Driven Framework for Continual Learning"

The paper "Learning to Remember: A Synaptic Plasticity Driven Framework for Continual Learning" presents a novel approach to addressing challenges inherent in Continual Learning (CL). The primary aim of CL is to enable models to learn from a continuous stream of data over indefinite periods, thereby maintaining relevance as new information becomes available. Two significant hurdles are preserving knowledge of past tasks while leveraging it for new learning and ensuring scalability as data volumes increase.

Dynamic Generative Memory

The authors propose Dynamic Generative Memory (DGM), a framework driven by synaptic plasticity principles to tackle these issues. DGM introduces a mechanism built upon conditional generative adversarial networks (GANs) augmented with learnable connection plasticity via neural masking. This approach is designed to manage continual learning more effectively by balancing the retention and integration of knowledge. The paper explores two variants of neural masking: one applied to layer activations and another directly to connection weights. These mechanisms allow for selective modulation of neural paths, thereby facilitating the retention of previous knowledge and the incorporation of new data.

Network Expansion

To address scalable learning, the paper introduces dynamic network expansion. This mechanism allows the model to self-regulate its capacity based on the complexity of incoming tasks. The determination of the extent of expansion is informed by the learned binary mask, ensuring that model complexity adjusts in response to task demands without unnecessary resource overhead.

Experimental Results

The experimental evaluation demonstrates the efficacy of DGM in continual class-incremental learning scenarios, particularly within visual classification tasks. The results indicate that DGM effectively balances memory retention of prior tasks while integrating new inputs, a notable challenge in CL. These results suggest that the dynamic adaptation strategies employed by DGM are conducive to sustaining model performance over prolonged and data-intensive tasks.

Implications and Future Directions

This research implicates significant progress in the field of continual learning, proposing a framework that can potentially be adapted and scaled across various domains necessitating uninterrupted and long-term model training. The flexibility inherent in the DGM's neural masking and dynamic expansion approaches suggests avenues for future work, notably in extending this framework's applicability to more complex and varied datasets. Additionally, while the paper demonstrates strong performance in visual tasks, future research could explore the adaptation of DGM to other data modalities. Further theoretical exploration of the interplay between synaptic plasticity and neural masking could yield deeper insights into the potential of biologically inspired methods in artificial intelligence.

In conclusion, "Learning to Remember: A Synaptic Plasticity Driven Framework for Continual Learning" makes substantial contributions to the domain of CL, providing an innovative approach that balances the retention of prior knowledge with the integration of new information and dynamic adaptation. While the paper focuses on visual classification tasks, the broader impacts of this framework offer promising directions for further exploration in scalable, real-world applications.

PDF Markdown