Reconciling meta-learning and continual learning with online mixtures of tasks (1812.06080v3)

Published 14 Dec 2018 in cs.LG and stat.ML

Abstract: Learning-to-learn or meta-learning leverages data-driven inductive bias to increase the efficiency of learning on a novel task. This approach encounters difficulty when transfer is not advantageous, for instance, when tasks are considerably dissimilar or change over time. We use the connection between gradient-based meta-learning and hierarchical Bayes to propose a Dirichlet process mixture of hierarchical Bayesian models over the parameters of an arbitrary parametric model such as a neural network. In contrast to consolidating inductive biases into a single set of hyperparameters, our approach of task-dependent hyperparameter selection better handles latent distribution shift, as demonstrated on a set of evolving, image-based, few-shot learning benchmarks.

Authors (4)

Ghassen Jerfel (14 papers)
Erin Grant (15 papers)
Thomas L. Griffiths (150 papers)
Katherine Heller (46 papers)

Citations (11)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Reconciling meta-learning and continual learning with online mixtures of tasks (1812.06080v3)

Summary

Related Papers