Uncertainty in Neural Networks: Approximately Bayesian Ensembling (1810.05546v5)

Published 12 Oct 2018 in stat.ML and cs.LG

Abstract: Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesian. This work proposes one modification to the usual process that we argue does result in approximate Bayesian inference; regularising parameters about values drawn from a distribution which can be set equal to the prior. A theoretical analysis of the procedure in a simplified setting suggests the recovered posterior is centred correctly but tends to have an underestimated marginal variance, and overestimated correlation. However, two conditions can lead to exact recovery. We argue that these conditions are partially present in NNs. Empirical evaluations demonstrate it has an advantage over standard ensembling, and is competitive with variational methods.

Citations (181)

View on Semantic Scholar

Summary

The paper’s main contribution is the introduction of anchored ensembling (Randomised MAP Sampling) to approximate Bayesian inference in neural networks.
The methodology regularizes network parameters around prior-sampled values to align ensemble predictions with Gaussian Process behaviors under specific conditions.
Empirical results across regression, classification, sentiment analysis, and reinforcement learning tasks demonstrate robust uncertainty estimation and state-of-the-art performance.

Insights into Approximately Bayesian Ensembling for Neural Networks

The paper entitled "Uncertainty in Neural Networks: Approximately Bayesian Ensembling" by Pearce et al. provides a detailed exploration into the quantification of uncertainty in neural networks (NNs) through a technique described as "anchored ensembling". This technique is proposed as a means to approximate Bayesian inference, addressing criticisms that traditional ensembling does not have a direct Bayesian interpretation.

At its core, the concept of anchored ensembling involves a strategic regularization of NN parameters around values sampled from a distribution that can be set equivalent to the prior. This approach is identified as Randomised MAP Sampling (RMS), which engages in generating an ensemble of predictive distributions that closely mimic those obtained via Gaussian Processes (GPs) in certain NN configurations.

Theoretical Foundation and Novel Contributions

The paper outlines the theoretical underpinning of RMS, where the regularization is guided by anchor distributions. The authors extend the discourse on Bayesian priors by proposing a non-trivial alteration to standard ensembling that theoretically aligns with Bayesian principles, despite the typically non-Bayesian lineage of standard ensembling methods.

A significant portion of the research explores the mathematical formulation of RMS under the simplifying assumption that both the parameter likelihood and the prior obey a multivariate normal distribution. Here, Pearce et al. demonstrate that, while exact recovery of the true Bayesian posterior is theoretically attainable under specific conditions (perfectly correlated parameters or extrapolation parameters), the general RMS approach results in a posterior with an accurately centered mean but underestimated variance and overestimated correlation.

Empirical Validation

Comprehensive empirical evaluation spanning regression, image classification, sentiment analysis, and reinforcement learning (RL) tasks affirms the competitive nature of anchored ensembling, showcasing its advantages over conventional ensembling techniques. Notably, the method achieves state-of-the-art performance on several benchmark datasets characterized by low data noise.

Figures presented within the paper illustrate that anchored ensemble predictive distributions closely follow those derived from exact Bayesian methods, demonstrating minimal bias in scenarios where ideal conditions are partially evident. This is particularly underscored in scenarios including, but not limited to, UCI regression tasks, where the method excels in offering robust and meaningful uncertainty estimates.

Implications and Future Directions

The implications of adopting anchored ensembling are manifold. Practically, the method facilitates scalable and computationally tractable approximation of Bayesian inference in large-scale neural networks. Theoretically, the approach fosters discussions around prior specification, contributing insights into the ongoing development of Bayesian Neural Networks (BNNs). Furthermore, the exploration of correlations and marginal variances in parameter space emboldens further research into optimizing ensembles for more accurate uncertainty characterization.

From a speculative standpoint, the integration of RMS into broader AI systems could enhance robustness, particularly in safety-critical applications and environments characterized by stochasticity and unknown parameters. Future directions might delve into the systematic identification and harnessing of correlated parameter clusters within broader architectures such as Convolutional Neural Networks (CNNs) and further elaborate upon the multimodality of parameter spaces in neural networks.

In summary, the contributions of this paper point towards a nuanced approximation approach that marries ensemble diversity with prior beliefs, advancing the field’s understanding of uncertainty quantification in neural networks. This work stands as a testament to the potential of approximately Bayesian methods in rendering uncertainty estimation both tractable and reliable, effectively paving the way for deeper integration of Bayesian techniques across modern machine learning paradigms.

PDF Markdown

Related Papers

YouTube

Show All Videos