Epistemic Neural Networks (2107.08924v8)

Published 19 Jul 2021 in cs.LG, cs.AI, and stat.ML

Abstract: Intelligence relies on an agent's knowledge of what it does not know. This capability can be assessed based on the quality of joint predictions of labels across multiple inputs. In principle, ensemble-based approaches produce effective joint predictions, but the computational costs of training large ensembles can become prohibitive. We introduce the epinet: an architecture that can supplement any conventional neural network, including large pretrained models, and can be trained with modest incremental computation to estimate uncertainty. With an epinet, conventional neural networks outperform very large ensembles, consisting of hundreds or more particles, with orders of magnitude less computation. The epinet does not fit the traditional framework of Bayesian neural networks. To accommodate development of approaches beyond BNNs, such as the epinet, we introduce the epistemic neural network (ENN) as an interface for models that produce joint predictions.

Citations (90)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/MarkSchmidtUBC/status/1902786064231846260

https://twitter.com/22146921/status/1736504934093164689

https://twitter.com/balajiln/status/1560007888881344512

Epistemic Neural Networks (2107.08924v8)

Summary

Related Papers

Tweets