Optimal experimental design: Formulations and computations (2407.16212v1)
Abstract: Questions of `how best to acquire data' are essential to modeling and prediction in the natural and social sciences, engineering applications, and beyond. Optimal experimental design (OED) formalizes these questions and creates computational methods to answer them. This article presents a systematic survey of modern OED, from its foundations in classical design theory to current research involving OED for complex models. We begin by reviewing criteria used to formulate an OED problem and thus to encode the goal of performing an experiment. We emphasize the flexibility of the Bayesian and decision-theoretic approach, which encompasses information-based criteria that are well-suited to nonlinear and non-Gaussian statistical models. We then discuss methods for estimating or bounding the values of these design criteria; this endeavor can be quite challenging due to strong nonlinearities, high parameter dimension, large per-sample costs, or settings where the model is implicit. A complementary set of computational issues involves optimization methods used to find a design; we discuss such methods in the discrete (combinatorial) setting of observation selection and in settings where an exact design can be continuously parameterized. Finally we present emerging methods for sequential OED that build non-myopic design policies, rather than explicit designs; these methods naturally adapt to the outcomes of past experiments in proposing new experiments, while seeking coordination among all experiments to be performed. Throughout, we highlight important open questions and challenges.
- Available at arXiv:2211.03952.
- Available at arXiv:2305.03855.
- Available at arXiv:2006.06755.
- Available at arXiv:2207.08670.
- Available at doi:10.1007/s10208-023-09630-x.
- Available at https://openreview.net/forum?id=AY8zfZm0tDd.
- Available at arXiv:2310.16906.
- Available at arXiv:2303.10525.
- Available at arXiv:2404.13056.
- Available at arXiv:2402.16000.
- C. Feng and Y. M. Marzouk (2019), A layered multiple importance sampling scheme for focused optimal Bayesian experimental design. Available at arXiv:1903.11187.
- R. B. Gramacy (2022), plgp: Particle learning of Gaussian processes. Available at https://cran.r-project.org/package=plgp.
- X. Huan and Y. M. Marzouk (2016), Sequential Bayesian optimal experimental design via approximate dynamic programming. Available at arXiv:1604.08320.
- Available at arXiv:2012.05942.
- Available at arXiv:1708.08719.
- S. Kleinegesse and M. U. Gutmann (2021), Gradient-based Bayesian experimental design for implicit models using mutual information lower bounds. Available at arXiv:2105.04379.
- Available at arXiv:2401.07971.
- Available at arXiv:2305.20025.
- Forthcoming.
- To appear in Bernoulli. Available at https://bernoullisociety.org/publications/ bernoulli-journal/bernoulli-journal-papers.
- Available at arXiv:2107.12364.
- Available at arXiv:2402.18337.
- Available at https:// artowen.su.domains/mc/.
- E. Pompe and P. E. Jacob (2021), Asymptotics of cut distributions and robust modular inference using posterior bootstrap. Available at arXiv:2110.11149.
- A.-A. Pooladian and J. Niles-Weed (2021), Entropic estimation of optimal transport maps. Available at arXiv:2109.12004.
- H. Rahimian and S. Mehrotra (2019), Distributionally robust optimization: A review. Available at arXiv:1908.05659.
- J. O. Royset (2022), Risk-adaptive approaches to learning and decision making: A survey. Available at arXiv:2212.00856.
- Available at http://ecommons.cornell.edu/ bitstream/handle/1813/8664/TR000781.pdf?sequence=1.
- W. Shen and X. Huan (2021), Bayesian sequential optimal experimental design for nonlinear models using policy gradient reinforcement learning. Available at arXiv:2110.15335.
- Available at arXiv:2306.10430.
- Available at https://openreview.net/forum?id=B1x62TNtDS.
- Available at arXiv:1807.03748.
- S. Wang and Y. Marzouk (2022), On minimax density estimation via measure transport. Available at arXiv:2207.10231.
- F. Yates (1937), The design and analysis of factorial experiments. Technical Communication no. 35, Imperial Bureau of Soil Science.
- Available at arXiv:2205.13111.
- Available at arXiv:2403.18072.