Meta-Learning through Hebbian Plasticity in Random Networks (2007.02686v5)

Published 6 Jul 2020 in cs.NE and cs.LG

Abstract: Lifelong learning and adaptability are two defining aspects of biological agents. Modern reinforcement learning (RL) approaches have shown significant progress in solving complex tasks, however once training is concluded, the found solutions are typically static and incapable of adapting to new information or perturbations. While it is still not completely understood how biological brains learn and adapt so efficiently from experience, it is believed that synaptic plasticity plays a prominent role in this process. Inspired by this biological mechanism, we propose a search method that, instead of optimizing the weight parameters of neural networks directly, only searches for synapse-specific Hebbian learning rules that allow the network to continuously self-organize its weights during the lifetime of the agent. We demonstrate our approach on several reinforcement learning tasks with different sensory modalities and more than 450K trainable plasticity parameters. We find that starting from completely random weights, the discovered Hebbian rules enable an agent to navigate a dynamical 2D-pixel environment; likewise they allow a simulated 3D quadrupedal robot to learn how to walk while adapting to morphological damage not seen during training and in the absence of any explicit reward or error signal in less than 100 timesteps. Code is available at https://github.com/enajx/HebbianMetaLearning.

Authors (2)

Elias Najarro (11 papers)
Sebastian Risi (77 papers)

Citations (72)

View on Semantic Scholar

Summary

Meta-Learning through Hebbian Plasticity in Random Networks

The paper "Meta-Learning through Hebbian Plasticity in Random Networks" by Elias Najarro and Sebastian Risi explores novel approaches to addressing the challenge of adaptability in reinforcement learning (RL) frameworks. The work stems from the premise that while RL agents have shown impressive capabilities in solving complex tasks, these solutions tend to be static post-training, lacking the dynamic adaptability inherent in biological organisms. The authors draw inspiration from biological neural mechanisms, particularly synaptic plasticity, to propose a method that leverages Hebbian learning rules to enable lifelong adaptability in neural network-based agents.

Overview of the Approach

The proposed methodology diverges from conventional RL paradigms that focus on fixed-weight optimization. Instead, it seeks to discover synapse-specific Hebbian learning rules that facilitate continuous weight adaptation during an agent's lifetime. The core concept revolves around initializing networks with random synaptic weights and employing local Hebbian learning rules to self-organize these weights in response to sensory feedback, independent of any explicit reward signals.

The paper showcases the proposed approach across RL tasks with various sensory modalities, particularly emphasizing a 2D-pixel navigation environment and a 3D quadrupedal locomotion task. Notably, the Hebbian rule-based networks demonstrate the capability to adapt to previously unencountered morphological damages in the 3D robot domain within a mere 100 timesteps, showcasing an impressive level of adaptability and robustness.

Key Findings and Results

A significant result presented in the paper is that the Hebbian networks consistently reach high performance from random initial weights, with the discovered learning rules enabling dynamic self-organization comparable to biological networks. For instance, in the vision-based CarRacing-v0 environment, the Hebbian networks achieved cumulative rewards on par with some state-of-the-art deep RL methods, all while maintaining robustness across 100 test rollouts.

Analyzing the performance in the 3D locomotion task, the Hebbian networks could adapt to untrained, damaged morphologies, a scenario where traditional static-weight networks failed. This adaptability is attributed to an emergent attractor in the weight phase-space, guiding the convergence to efficient dynamic weights regardless of initial conditions.

Implications and Future Directions

This research has critical implications for both practical applications and theoretical advancements. Practically, the ability for agents to adapt in real-time to environmental changes without retraining enhances the potential for deploying autonomous systems in dynamic real-world environments. Theoretically, the findings contribute to understanding how neural plasticity principles can be effectively translated into artificial systems, potentially mirroring adaptive mechanisms observed in biological brains.

The paper sets a foundation for future exploration into incorporating more complex neuromodulation strategies, which could further enhance the adaptability and performance of RL agents. Additionally, expanding this work to evolve not just learning rules but also network architecture and incorporating indirect genotype-to-phenotype mappings could yield more efficient and potentially more biologically plausible models.

Overall, "Meta-Learning through Hebbian Plasticity in Random Networks" opens promising avenues for the development of RL systems that more closely mirror the adaptability and learning efficiency of biological organisms. The integration of Hebbian principles presents a nuanced perspective in tackling the enduring challenges of lifelong learning and autonomous adaptability in artificial intelligence.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - enajx/HebbianMetaLearning: Meta-Learning through Hebbian Plasticity in Random Networks: https://arxiv.org/abs/2007.02686 (130 stars)

Tweets

https://twitter.com/risi1979/status/1311396151996887040

https://twitter.com/mtasic85/status/1703077613462335699

https://twitter.com/EldarSilver/status/1446770188087607296

YouTube

Show All Videos