Meta Learning Backpropagation And Improving It (2012.14905v4)

Published 29 Dec 2020 in cs.LG, cs.AI, cs.NE, and stat.ML

Abstract: Many concepts have been proposed for meta learning with neural networks (NNs), e.g., NNs that learn to reprogram fast weights, Hebbian plasticity, learned learning rules, and meta recurrent NNs. Our Variable Shared Meta Learning (VSML) unifies the above and demonstrates that simple weight-sharing and sparsity in an NN is sufficient to express powerful learning algorithms (LAs) in a reusable fashion. A simple implementation of VSML where the weights of a neural network are replaced by tiny LSTMs allows for implementing the backpropagation LA solely by running in forward-mode. It can even meta learn new LAs that differ from online backpropagation and generalize to datasets outside of the meta training distribution without explicit gradient calculation. Introspection reveals that our meta learned LAs learn through fast association in a way that is qualitatively different from gradient descent.

Citations (53)

View on Semantic Scholar

Summary

The paper presents a unified meta-learning approach using VSML that leverages sparsity and weight-sharing to derive reusable neural learning algorithms.
The methodology replaces traditional weights with LSTM components, enabling adaptive learning without explicit gradient calculations.
VSML demonstrates efficient generalization on benchmarks like MNIST and Fashion MNIST, highlighting its potential to reduce human intervention in algorithm design.

Analysis of "Meta Learning Backpropagation And Improving It"

The paper authored by Kirsch and Schmidhuber introduces a novel approach within the domain of meta-learning, wherein the focus is placed on learning neural network algorithms that can generalize beyond their training environments. The methodology proposed, Variable Shared Meta Learning (VSML), stands out by leveraging the principle of sparsity and weight-sharing within neural networks to express potent learning algorithms that can be reused across different tasks. This paper aligns with a significant trend in meta-learning research aimed at reducing human dependency in crafting specific learning algorithms and instead focuses on dynamic learning strategies that adapt to varied datasets and environments.

Key Contributions

Unified Framework for Meta Learning: VSML unifies several concepts such as fast weights, learned learning rules, and Hebbian plasticity, establishing a coherent framework for meta-learning. The paper argues that simple weight-sharing in neural networks suffices to facilitate powerful, reusable learning mechanisms that are distinct from traditional backpropagation-based approaches.
Adaptive Learning Algorithms: The authors propose a system that replaces conventional weights in neural networks with tiny long short-term memory networks (LSTMs) operating in forward-mode only. This system can meta-learn algorithms that not only operate differently from online backpropagation but also demonstrate the capability to generalize to data outside the initial training distribution without requiring explicit gradient calculations.
Practical Demonstration of VSML: The paper thoroughly evaluates VSML's capability to meta-learn learning algorithms from scratch, illustrating that these systems learn with remarkable efficiency compared to standard gradient descent methods. The experiments demonstrate adaptability, showcasing the potential for the meta-learned learning algorithms to generalize across different datasets like MNIST and Fashion MNIST.
Introspection into Meta-Learned Algorithms: The authors present introspective analyses revealing that the meta-learned algorithms in VSML facilitate learning through fast associations, a learning behavior qualitatively distinct from that of gradient descent methods.

Results and Implications

The experimental results underscore the efficacy of VSML in learning algorithms that generalize broadly, even to datasets significantly divergent from those seen during training. These results emphasize the potential for VSML to reduce the reliance on human-designed components, such as the traditional backpropagation, leading to more adaptive and resilient learning systems.

Future Prospects

Several intriguing prospects for future research arise from this work. The scalability of VSML to handle more complex tasks and larger datasets could profoundly impact reinforcement learning fields. There remains an open avenue for refining and optimizing meta-learning strategies to leverage this weight-sharing and sparsity principle further. Given the growing interest in improving memory efficiency and sample efficiency, especially in continual learning environments, VSML provides a promising direction for research.

In conclusion, this paper contributes a significant advancement towards the goal of achieving adaptive, general-purpose learning systems by instilling them with the capacity to meta-learn effective learning algorithms devoid of the conscribed reliance on traditional backpropagation. As the field matures, methods like VSML hold the promise to revolutionize how learning systems are developed and deployed, embodying a shift towards systems that learn to learn across dynamic and diverse environments.

PDF Markdown

Related Papers

YouTube

Show All Videos