A general framework for ensemble distribution distillation (2002.11531v2)

Published 26 Feb 2020 in stat.ML and cs.LG

Abstract: Ensembles of neural networks have been shown to give better performance than single networks, both in terms of predictions and uncertainty estimation. Additionally, ensembles allow the uncertainty to be decomposed into aleatoric (data) and epistemic (model) components, giving a more complete picture of the predictive uncertainty. Ensemble distillation is the process of compressing an ensemble into a single model, often resulting in a leaner model that still outperforms the individual ensemble members. Unfortunately, standard distillation erases the natural uncertainty decomposition of the ensemble. We present a general framework for distilling both regression and classification ensembles in a way that preserves the decomposition. We demonstrate the desired behaviour of our framework and show that its predictive performance is on par with standard distillation.

Authors (4)

Jakob Lindqvist (5 papers)
Amanda Olmin (5 papers)
Fredrik Lindsten (69 papers)
Lennart Svensson (81 papers)

Citations (18)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

A general framework for ensemble distribution distillation (2002.11531v2)

Summary

Related Papers