Reinforced Continual Learning (1805.12369v1)

Published 31 May 2018 in cs.LG, cs.CV, and stat.ML

Abstract: Most artificial intelligence models have limiting ability to solve new tasks faster, without forgetting previously acquired knowledge. The recently emerging paradigm of continual learning aims to solve this issue, in which the model learns various tasks in a sequential fashion. In this work, a novel approach for continual learning is proposed, which searches for the best neural architecture for each coming task via sophisticatedly designed reinforcement learning strategies. We name it as Reinforced Continual Learning. Our method not only has good performance on preventing catastrophic forgetting but also fits new tasks well. The experiments on sequential classification tasks for variants of MNIST and CIFAR-100 datasets demonstrate that the proposed approach outperforms existing continual learning alternatives for deep networks.

PDF Abstract

Reinforced Continual Learning: An Overview

The paper "Reinforced Continual Learning" by Ju Xu and Zhanxing Zhu addresses the critical challenge of addressing catastrophic forgetting in artificial intelligence models when learning sequential tasks. Continual learning, also termed lifelong learning, aims to enable models to learn consecutive tasks while retaining previously acquired knowledge. Traditional approaches have encountered challenges in maintaining the balance between learning new tasks effectively and preserving old task knowledge.

Methodology

This work introduces a novel approach named Reinforced Continual Learning (RCL), which leverages reinforcement learning strategies to dynamically adjust neural network architectures. Unlike conventional methods that rely on static architectures or naive expansion strategies, RCL employs a controller based on reinforcement learning to selectively expand the model architecture. This dynamic adjustment is framed as a combinatorial optimization problem where reinforcement learning, specifically an actor-critic approach, is used to optimize the expansion process based on a reward signal that incorporates both validation accuracy and model complexity.

RCL's innovation lies in its ability to determine the optimal number of nodes or filters to add for each task dynamically. The controller network, implemented as a Long Short-Term Memory (LSTM) network, generates architectural modifications. It strikes a balance between preventing catastrophic forgetting and maintaining computational efficiency. This approach is evaluated on sequential task settings using MNIST and CIFAR-100 variants. The authors report that RCL outperforms existing continual learning methods, achieving better test accuracy with reduced model complexity.

Results

The empirical evaluation demonstrates RCL's superiority over existing methods in both average test accuracy and model complexity across all datasets. In particular, RCL showed notable parameter reduction when compared to Progressive Networks and Dynamical Expandable Networks (DEN) while maintaining competitive test accuracy. The paper provides insightful comparisons against both static and expandable network-based continual learning approaches.

RCL demonstrates a significant reduction in model parameters for the CIFAR-100 dataset, with a reduction by 42% and 53% compared to Progressive Networks and DEN, respectively. Additionally, the authors provide evidence that the policy-based expansion strategy not only prevents forgetting more effectively but also leads to a better balance between performance metrics and computational burden than previous methods.

Implications and Future Directions

The introduction of reinforcement learning into the architecture expansion decision-making process opens new avenues for optimizing neural networks in continual learning scenarios. The adaptive architectural adjustments made by RCL could inspire future research focused on enhancing backward transfer capabilities, where learning new tasks may improve performance on previously learned tasks. Additionally, utilizing reinforcement learning in this context raises considerations about the computational efficiency of such an approach, warranting further investigation to reduce training time, particularly for more complex task networks.

Future research could extend RCL's framework to various types of neural network architectures, explore its applicability across different domains of artificial intelligence tasks, and refine the reinforcement learning-based policy to further improve learning efficiency and scalability.

This paper's methodological contribution to dynamic network expansion using reinforcement learning offers a refreshing perspective in the ongoing development of robust and efficient continual learning systems, potentially informing advancements in artificial intelligence applications where sequential learning with minimal resource expenditure is critical.

PDF Markdown Bookmark Chat (Pro)

Authors (2)

Ju Xu (7 papers)
Zhanxing Zhu (54 papers)

Citations (347)

View on Semantic Scholar

Reinforced Continual Learning (1805.12369v1)

Reinforced Continual Learning: An Overview

Methodology

Results

Implications and Future Directions

Related Papers