Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods (1910.09457v3)

Published 21 Oct 2019 in cs.LG and stat.ML

Abstract: The notion of uncertainty is of major importance in machine learning and constitutes a key element of machine learning methodology. In line with the statistical tradition, uncertainty has long been perceived as almost synonymous with standard probability and probabilistic predictions. Yet, due to the steadily increasing relevance of machine learning for practical applications and related issues such as safety requirements, new problems and challenges have recently been identified by machine learning scholars, and these problems may call for new methodological developments. In particular, this includes the importance of distinguishing between (at least) two different types of uncertainty, often referred to as aleatoric and epistemic. In this paper, we provide an introduction to the topic of uncertainty in machine learning as well as an overview of attempts so far at handling uncertainty in general and formalizing this distinction in particular.

Citations (1,168)

View on Semantic Scholar

Summary

The paper distinguishes aleatoric uncertainty inherent in data from epistemic uncertainty arising from model limitations.
It outlines methodologies such as Bayesian inference, deep learning, and conformal prediction to assess and manage uncertainty.
The study emphasizes robust uncertainty quantification to enhance safety in critical applications like medical diagnosis and autonomous driving.

Aleatoric and Epistemic Uncertainty in Machine Learning: Concepts and Methods

The paper by Hüllermeier and Waegeman elucidates the critical importance of distinguishing between aleatoric and epistemic uncertainty in machine learning. The authors strive to provide a foundational introduction to these concepts, addressing how different sources of uncertainty affect model predictions. Traditionally, uncertainty in machine learning has been largely handled within the probabilistic framework, but as machine learning increasingly permeates safety-critical applications, the need to distinguish and manage different types of uncertainty has become more pronounced.

Introduction

Machine learning often involves creating models that generalize beyond the observed data, inherently dealing with uncertainty. Two primary sources of uncertainty are identified:

Aleatoric Uncertainty (statistical uncertainty): This refers to inherent randomness or variability in the data generation process, which cannot be reduced even with more data.
Epistemic Uncertainty (systematic uncertainty): This arises from a lack of knowledge about the best model or the true underlying process. Unlike aleatoric uncertainty, epistemic uncertainty can potentially be reduced with more information.

Sources and Nature of Uncertainty

In supervised learning, the primary goal is to make predictions based on observed data. However, predictions inherently carry uncertainty, which can be attributed to various sources:

Aleatoric Uncertainty: This is irreducible and stems from the stochastic nature of the data. For instance, the inherent randomness in a fair dice roll.
Epistemic Uncertainty: This is reducible with additional data. For example, uncertainty due to selecting a model structure that may not best represent the true data-generating process.

The authors argue that the traditional probabilistic framework in machine learning conflates these types of uncertainties, often leading to inadequately managed risk in applications requiring high reliability, such as medical diagnosis and autonomous driving.

Methodological Approaches for Handling Uncertainty

Several methodologies are discussed, focusing on distinguishing and handling different types of uncertainties:

Version Space Learning:
- This method maintains a set of hypotheses consistent with the observed data, shrinking as more data becomes available.
- Useful for clear instances where hypotheses can be confidently ruled in or out, mainly suited for capturing epistemic uncertainty.
Bayesian Inference:
- Bayesian methods update the hypothesis space by forming a posterior distribution over hypotheses given the data.
- They address both aleatoric and epistemic uncertainties by averaging predictions from plausible hypotheses.
- The approach uses techniques like Gaussian processes, enabling uncertainty quantification for complex model structures by building posterior predictive distributions.
Likelihood-based Methods:
- Methods like Fisher Information can construct confidence regions around parameter estimates, reflecting epistemic uncertainty in parameter value estimates.
- These techniques, rooted in frequentist principles, offer a robust mathematical framework for uncertainty quantification.
Deep Learning Approaches:
- Novel methods like Bayesian Neural Networks (BNNs) and techniques relying on Monte Carlo Dropout can quantify uncertainty in neural networks.
- These innovations capture epistemic uncertainty by treating model weights as probability distributions.
Set-based and Credal Approaches:
- Credal sets, representing sets of plausible probabilities, generalize Bayesian frameworks by accounting for greater epistemic uncertainty.
- These methods can practicalize robust decision-making in scenarios where model and prediction risk are closely tied to uncertainties.
Conformal Prediction:
- This method constructs prediction regions based on hypothesis testing, tailored to provide reliable prediction outcomes.
- It is particularly adept at balancing the trade-off between capturing sufficient outcomes and ensuring high-confidence predictions.

Implications and Future Directions

The implications of adequately handling different types of uncertainties are both practical and theoretical. By better modeling epistemic uncertainties, machine learning models can be made more robust and reliable, especially in critical applications where safety and reliability cannot be compromised.

Future work in AI should focus on refining uncertainty quantification methods and integrating these insights into more widespread machine learning practices. Research should also extend to developing empirically validated measures for evaluating uncertainty management methodologies, ensuring that theoretical advancements translate effectively into practical utility.

In conclusion, the paper underlines the nuanced nature of uncertainty in machine learning, advocating for a more rigorous and differentiated approach to uncertainty management. This, in turn, ensures more reliable and trustworthy AI systems, advancing the field's applicability to an ever-widening array of complex, real-world problems.