Mining gold from implicit models to improve likelihood-free inference (1805.12244v4)

Published 30 May 2018 in stat.ML, cs.LG, hep-ph, and physics.data-an

Abstract: Simulators often provide the best description of real-world phenomena. However, they also lead to challenging inverse problems because the density they implicitly define is often intractable. We present a new suite of simulation-based inference techniques that go beyond the traditional Approximate Bayesian Computation approach, which struggles in a high-dimensional setting, and extend methods that use surrogate models based on neural networks. We show that additional information, such as the joint likelihood ratio and the joint score, can often be extracted from simulators and used to augment the training data for these surrogate models. Finally, we demonstrate that these new techniques are more sample efficient and provide higher-fidelity inference than traditional methods.

Authors (4)

Johann Brehmer (34 papers)
Gilles Louppe (68 papers)
Juan Pavez (7 papers)
Kyle Cranmer (81 papers)

Citations (170)

View on Semantic Scholar

Summary

Mining Gold from Implicit Models to Improve Likelihood-Free Inference

The paper "Mining Gold from Implicit Models to Improve Likelihood-Free Inference" presents a significant advancement in methods for likelihood-free inference, focusing on the utilization of implicit models defined by simulators. Classical approaches to inference often struggle with simulators since they implicitly define densities that are intractable for direct calculation due to high-dimensional latent spaces. This complexity is prevalent across diverse fields such as particle physics, epidemiology, and population genetics. Although approximate algorithms like Approximate Bayesian Computation (ABC) and neural density estimation (NDE) have been developed to address these challenges, the efficiency of these techniques is often undermined by the computational cost of simulator runs.

The authors introduce innovative methodologies by leveraging augmented data extracted from the simulators, specifically focusing on the joint score and joint likelihood ratio, which are conditioned on the latent variables characterizing the simulation process. The core contribution is the derivation of several loss functions that enable the surrogate models to exploit this augmented data, resulting in improved sample efficiency and enhanced inference quality.

Key Contributions

Augmented Data Utilization: The paper proposes extracting the joint score and joint likelihood ratio from simulators. This data can significantly enhance inference by transforming the implicit likelihood properties into tractable forms, allowing for efficient training of surrogate models through newly defined loss functions.
Novel Loss Functions: Several loss functions are introduced that are minimized by the likelihood or likelihood ratio. These loss functions facilitate the training of neural networks to accurately predict these quantities, thus improving likelihood-free inference methodologies.
Local Model Expansion: The authors propose a local approximation around a reference parameter value, where the score vector acts as a sufficient statistic. This allows for localized optimal summary statistics and avoids the high dimensionality that traditionally hampers inference methods.
Applications and Experiments: The paper includes experimental results across different domains, demonstrating the improved sample efficiency of the new techniques. The examples such as the generalized Galton board, Lotka-Volterra model, and a particle physics simulation underscore the versatility and robustness of the proposed methods.

Implications and Future Work

The advancements presented in this paper have profound implications for the future developments in simulation-based inference. By utilizing the latent space information that simulators inherently provide, researchers can achieve more efficient and accurate inference. This approach is particularly promising for areas where direct likelihood calculations are infeasible due to computational limitations.

Future research could explore automating the extraction of augmented data from simulators, as implemented in the paper proof-of-concept with the Pyro library. The automation of these processes, coupled with refinements in probabilistic programming and machine learning frameworks, would significantly streamline the inference procedure, enhancing the applicability to a broader range of problems in science and engineering.

Overall, the paper’s methodologies set the stage for enhanced interaction between deep learning techniques and traditional simulation-based models, heralding advancements in both theoretical understanding and practical applications in the field of likelihood-free inference.

PDF Markdown

Related Papers

Find Related Papers