Multi-headed Neural Ensemble Search (2107.04369v1)

Published 9 Jul 2021 in cs.LG and stat.ML

Abstract: Ensembles of CNN models trained with different seeds (also known as Deep Ensembles) are known to achieve superior performance over a single copy of the CNN. Neural Ensemble Search (NES) can further boost performance by adding architectural diversity. However, the scope of NES remains prohibitive under limited computational resources. In this work, we extend NES to multi-headed ensembles, which consist of a shared backbone attached to multiple prediction heads. Unlike Deep Ensembles, these multi-headed ensembles can be trained end to end, which enables us to leverage one-shot NAS methods to optimize an ensemble objective. With extensive empirical evaluations, we demonstrate that multi-headed ensemble search finds robust ensembles 3 times faster, while having comparable performance to other ensemble search methods, in both predictive performance and uncertainty calibration.

Authors (5)

Ashwin Raaghav Narayanan (1 paper)
Arber Zela (22 papers)
Tonmoy Saikia (8 papers)
Thomas Brox (134 papers)
Frank Hutter (177 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Multi-headed Neural Ensemble Search (2107.04369v1)

Summary

Related Papers