Active Learning and Bayesian Optimization: a Unified Perspective to Learn with a Goal (2303.01560v4)
Abstract: Science and Engineering applications are typically associated with expensive optimization problems to identify optimal design solutions and states of the system of interest. Bayesian optimization and active learning compute surrogate models through efficient adaptive sampling schemes to assist and accelerate this search task toward a given optimization goal. Both those methodologies are driven by specific infill/learning criteria which quantify the utility with respect to the set goal of evaluating the objective function for unknown combinations of optimization variables. While the two fields have seen an exponential growth in popularity in the past decades, their dualism and synergy have received relatively little attention to date. This paper discusses and formalizes the synergy between Bayesian optimization and active learning as symbiotic adaptive sampling methodologies driven by common principles. In particular, we demonstrate this unified perspective through the formalization of the analogy between the Bayesian infill criteria and active learning criteria as driving principles of both the goal-driven procedures. To support our original perspective, we propose a general classification of adaptive sampling techniques to highlight similarities and differences between the vast families of adaptive sampling, active learning, and Bayesian optimization. Accordingly, the synergy is demonstrated mapping the Bayesian infill criteria with the active learning criteria, and is formalized for searches informed by both a single information source and multiple levels of fidelity. In addition, we provide guidelines to apply those learning criteria investigating the performance of different Bayesian schemes for a variety of benchmark problems to highlight benefits and limitations over mathematical properties that characterize real-world applications.
- Engineering design optimization. Cambridge University Press, 2021.
- Prognostics and health management of engineering systems. Switzerland: Springer International Publishing, 2017.
- Metamodels for computer-based engineering design: survey and recommendations. Engineering with computers, 17:129–150, 2001.
- Making the most out of surrogate models: tricks of the trade. In International design engineering technical conferences and computers and information in engineering conference, volume 44090, pages 587–598, 2010.
- Design and analysis of computer experiments in multidisciplinary design optimization: a review of how far we have come-or not. In 12th AIAA/ISSMO multidisciplinary analysis and optimization conference, page 5802, 2008.
- Constraint aggregation for large number of constraints in wing surrogate-based optimization. Structural and Multidisciplinary Optimization, 59:421–438, 2019.
- Multi-objective optimization of trimaran sidehull arrangement via surrogate-based approach for reducing resistance and improving the seakeeping performance. Proceedings of the institution of mechanical engineers, part M: journal of engineering for the maritime environment, 235(4):944–956, 2021.
- Ensemble of deep learning models with surrogate-based optimization for medical image segmentation. In 2022 IEEE Congress on Evolutionary Computation (CEC), pages 1–8. IEEE, 2022.
- Testing of autonomous vehicles using surrogate models and stochastic optimization. In 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), pages 1–6. IEEE, 2017.
- Surrogate based optimization of a process of polycrystalline silicon production. Computers & Chemical Engineering, 140:106870, 2020.
- Surrogate-based optimization for variational quantum algorithms. Physical Review A, 107(3):032415, 2023.
- Surrogate-based analysis and optimization. Progress in aerospace sciences, 41(1):1–28, 2005.
- Gary Wang and S. Shan. Review of metamodeling techniques in support of engineering design optimization. Journal of Mechanical Design - J MECH DESIGN, 129, 04 2007.
- Parallel surrogate-assisted global optimization with expensive functions–a survey. Structural and Multidisciplinary Optimization, 54:3–13, 2016.
- A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics, 42(1):55–61, 2000.
- Fractional factorial design. Wiley Interdisciplinary Reviews: Computational Statistics, 1(2):234–244, 2009.
- Douglas C Montgomery. Design and analysis of experiments. John wiley & sons, 2017.
- Conjoint measurement: Methods and applications. Springer Science & Business Media, 2013.
- Response surface methodology: process and product optimization using designed experiments. John Wiley & Sons, 2016.
- Herman Chernoff. Sequential design of experiments. The Annals of Mathematical Statistics, 30(3):755–770, 1959.
- On sequential sampling for global metamodeling in engineering design. In International design engineering technical conferences and computers and information in engineering conference, volume 36223, pages 539–548, 2002.
- Efficient progressive sampling. In Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 23–32, 1999.
- Special section on multidisciplinary design optimization: metamodeling in multidisciplinary design optimization: how far have we really come? AIAA journal, 52(4):670–690, 2014.
- A survey of adaptive sampling for global metamodeling in support of simulation-based complex engineering design. Structural and Multidisciplinary Optimization, 57:393–416, 2018.
- Adaptive sampling approaches for surrogate-based optimization. In Computer Aided Chemical Engineering, volume 47, pages 377–384. Elsevier, 2019.
- Taking the human out of the loop: A review of bayesian optimization. Proceedings of the IEEE, 104(1):148–175, 2015.
- Peter I Frazier. A tutorial on bayesian optimization. arXiv preprint arXiv:1807.02811, 2018.
- Analytical benchmark problems for multifidelity optimization methods. arXiv preprint arXiv:2204.07867, 2022.
- Goal-driven learning. MIT press, 1995.
- Estimation of local modeling error and goal-oriented adaptive modeling of heterogeneous materials: I. error estimates and adaptive algorithms. Journal of Computational Physics, 164(1):22–47, 2000.
- Goal-oriented, model-constrained optimization for reduction of large-scale systems. Journal of Computational Physics, 224(2):880–896, 2007.
- Goal-oriented inference: Approach, linear theory, and application to advection diffusion. siam REVIEW, 55(3):493–519, 2013.
- Non-myopic multifidelity bayesian optimization. arXiv preprint arXiv:2207.06325, 2022.
- Raal: Resource aware active learning for multifidelity efficient optimization. AIAA Journal, 61(6):2744–2753, 2023.
- Pool-based active learning in approximate linear regression. Machine Learning, 75(3):249–274, 2009.
- Dongrui Wu. Pool-based sequential active learning for regression. IEEE transactions on neural networks and learning systems, 30(5):1348–1359, 2018.
- Alexander Shapiro. Monte carlo sampling methods. Handbooks in operations research and management science, 10:353–425, 2003.
- Art B Owen. Quasi-monte carlo sampling. Monte Carlo Ray Tracing: Siggraph, 1:69–88, 2003.
- Adaptive importance sampling. In Structural Safety and Reliability, pages 855–862. ASCE, 1989.
- Adaptive importance sampling: The past, the present, and the future. IEEE Signal Processing Magazine, 34(4):60–79, 2017.
- Multifidelity importance sampling. Computer Methods in Applied Mechanics and Engineering, 300:490–509, 2016.
- On adaptive markov chain monte carlo algorithms. Bernoulli, 11(5):815–828, 2005.
- Adaptive markov chain monte carlo: theory and methods. Bayesian time series models, 1, 2011.
- Adaptive importance sampling in signal processing. Digital Signal Processing, 47:36–49, 2015.
- Abrupt motion tracking via intensively adaptive markov-chain monte carlo sampling. IEEE Transactions on Image Processing, 21(2):789–801, 2011.
- A new reliability method for small failure probability problems by combining the adaptive importance sampling and surrogate models. Computer Methods in Applied Mechanics and Engineering, 372:113336, 2020.
- An adaptive scheme for reliability-based global design optimization: A markov chain monte carlo approach. Mechanical Systems and Signal Processing, 143:106836, 2020.
- Monte Carlo statistical methods, volume 2. Springer, 1999.
- Tim Hesterberg. Weighted average importance sampling and defensive mixture distributions. Technometrics, 37(2):185–194, 1995.
- An adaptive population importance sampler: Learning from uncertainty. IEEE Transactions on Signal Processing, 63(16):4422–4437, 2015.
- Efficient adaptive multiple importance sampling. In 2019 27th European Signal Processing Conference (EUSIPCO), pages 1–5. IEEE, 2019.
- Local and dimension adaptive stochastic collocation for uncertainty quantification. In Sparse grids and applications, pages 181–203. Springer, 2012.
- Adaptive multi-index collocation for uncertainty quantification and sensitivity analysis. International Journal for Numerical Methods in Engineering, 121(6):1314–1343, 2020.
- An adaptive wavelet stochastic collocation method for irregular solutions of partial differential equations with random input data. In Sparse Grids and Applications-Munich 2012, pages 137–170. Springer, 2014.
- On the convergence of adaptive stochastic collocation for elliptic partial differential equations with affine diffusion. SIAM Journal on Numerical Analysis, 60(2):659–687, 2022.
- A stochastic collocation approach to bayesian inference in inverse problems. Communications in Computational Physics, 6(4):826–847, 2009.
- An efficient bayesian inference approach to inverse problems based on an adaptive sparse grid collocation method. Inverse Problems, 25(3):035013, 2009.
- Multi-index stochastic collocation for random pdes. Computer Methods in Applied Mechanics and Engineering, 306:95–122, 2016.
- A fully adaptive multilevel stochastic collocation strategy for solving elliptic pdes with random data. Journal of Computational Physics, 419:109692, 2020.
- Dimension–adaptive tensor–product quadrature. Computing, 71:65–87, 2003.
- A posteriori error estimation for the stochastic collocation finite element method. SIAM Journal on Numerical Analysis, 56(5):3121–3143, 2018.
- A comprehensive study of non-adaptive and residual-based adaptive sampling for physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering, 403:115671, 2023.
- Deepxde: A deep learning library for solving differential equations. SIAM review, 63(1):208–228, 2021.
- Efficient training of physics-informed neural networks via importance sampling. Computer-Aided Civil and Infrastructure Engineering, 36(8):962–977, 2021.
- Jonas Mockus. Bayesian approach to global optimization: theory and applications, volume 37. Springer Science & Business Media, 2012.
- Burr Settles. Active learning literature survey. 2009.
- A comparative survey: Benchmarking for pool-based active learning. In IJCAI, pages 4679–4686, 2021.
- Harold J Kushner. A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise. 1964.
- AG Zhilinskas. Single-step bayesian search method for an extremum of functions of a single variable. Cybernetics, 11(1):160–166, 1975.
- Jonas Močkus. On bayesian methods for seeking the extremum. In Optimization Techniques IFIP Technical Conference: Novosibirsk, July 1–7, 1974, pages 400–404. Springer, 1975.
- Bruce E Stuckman. A global search method for optimizing nonlinear systems. IEEE Transactions on Systems, Man, and Cybernetics, 18(6):965–977, 1988.
- John F Elder. Global r/sup d/optimization when probes are expensive: the grope algorithm. In [Proceedings] 1992 IEEE International Conference on Systems, Man, and Cybernetics, pages 577–582. IEEE, 1992.
- Efficient global optimization of expensive black-box functions. Journal of Global optimization, 13(4):455–492, 1998.
- Thomas A Zang. Needs and opportunities for uncertainty-based multidisciplinary design methods for aerospace vehicles. National Aeronautics and Space Administration, Langley Research Center, 2002.
- A bayesian approach for shadow extraction from a single image. In Tenth IEEE International Conference on Computer Vision (ICCV’05) Volume 1, volume 1, pages 480–487 Vol. 1, 2005.
- A bayesian optimization algorithm for the nurse scheduling problem. In The 2003 Congress on Evolutionary Computation, 2003. CEC’03., volume 3, pages 2149–2156. IEEE, 2003.
- Bayesian optimization for materials design. Information science for materials discovery and design, pages 45–75, 2016.
- Advances in bayesian optimization with applications in aerospace engineering. In 2018 AIAA Non-Deterministic Approaches Conference, page 1656, 2018.
- Energy management strategy for electric vehicles based on deep q-learning using bayesian optimization. Neural Computing and Applications, 32:14431–14445, 2020.
- Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics. Machine Learning, pages 1–35, 2021.
- Efficient exploration of reward functions in inverse reinforcement learning via bayesian optimization. Advances in Neural Information Processing Systems, 33:4187–4198, 2020.
- Distributed bayesian optimization of deep reinforcement learning algorithms. Journal of Parallel and Distributed Computing, 139:43–52, 2020.
- Financial applications of gaussian processes and bayesian optimization. arXiv preprint arXiv:1903.04841, 2019.
- Cryptocurrency price prediction with neural networks of lstm and bayesian optimization. European Journal of Business and Management Research, 7(2):20–27, 2022.
- Automatic tuning of hyperparameters using bayesian optimization. Evolving Systems, 12:217–223, 2021.
- Bayesian optimization is superior to random search for machine learning hyperparameter tuning: Analysis of the black-box optimization challenge 2020. In NeurIPS 2020 Competition and Demonstration Track, pages 3–26. PMLR, 2021.
- Crowdsourcing interface feature design with bayesian optimization. In Proceedings of the 2019 chi conference on human factors in computing systems, pages 1–12, 2019.
- Sequential gallery for interactive visual design optimization. ACM Transactions on Graphics (TOG), 39(4):88–1, 2020.
- Carl Edward Rasmussen. Gaussian processes in machine learning. In Summer school on machine learning, pages 63–71. Springer, 2003.
- Gaussian processes for global optimization. In 3rd international conference on learning and intelligent optimization (LION3), pages 1–15. Citeseer, 2009.
- Entropy search for information-efficient global optimization. Journal of Machine Learning Research, 13(6), 2012.
- Zi Wang and Stefanie Jegelka. Max-value entropy search for efficient bayesian optimization. In International Conference on Machine Learning, pages 3627–3635. PMLR, 2017.
- The correlated knowledge gradient for simulation optimization of continuous parameters using gaussian process regression. SIAM Journal on Optimization, 21(3):996–1026, 2011.
- Lookahead bayesian optimization with inequality constraints. Advances in Neural Information Processing Systems, 30, 2017.
- Practical two-step lookahead bayesian optimization. Advances in neural information processing systems, 32, 2019.
- Multi-fidelity optimization via surrogate modelling. Proceedings of the royal society a: mathematical, physical and engineering sciences, 463(2088):3251–3269, 2007.
- Survey of multifidelity methods in uncertainty propagation, inference, and optimization. Siam Review, 60(3):550–591, 2018.
- Comparison of multi-fidelity approaches for military vehicle design. In AIAA AVIATION 2020 FORUM, page 3158, 2020.
- Issues in deciding whether to use multifidelity surrogates. Aiaa Journal, 57(5):2039–2054, 2019.
- Efficient surrogate model development: impact of sample size and underlying model dimensions. In Computer Aided Chemical Engineering, volume 44, pages 979–984. Elsevier, 2018.
- Multi-fidelity optimization of super-cavitating hydrofoils. Computer Methods in Applied Mechanics and Engineering, 332:63–85, 2018.
- Multifidelity do-main-aware learning for the design of re-entry vehicles. Structural and Multidisciplinary Optimization, 64(5):3017–3035, 2021.
- Nm-mf: Non-myopic multifidelity framework for constrained multi-regime aerodynamic optimization. AIAA Journal, 61(3):1270–1280, 2023.
- Practical multi-fidelity bayesian optimization for hyperparameter tuning. In Uncertainty in Artificial Intelligence, pages 788–798. PMLR, 2020.
- Multi-fidelity bayesian optimisation with continuous approximations. In International Conference on Machine Learning, pages 1799–1808. PMLR, 2017.
- Scalable inverse reinforcement learning through multifidelity bayesian optimization. IEEE transactions on neural networks and learning systems, 2021.
- Multi-objective and multi-fidelity bayesian optimization of laser-plasma acceleration. Physical Review Research, 5(1):013063, 2023.
- Multi-fidelity bayesian optimization to solve the inverse stefan problem. Computer Methods in Applied Mechanics and Engineering, 410:115946, 2023.
- Model inversion via multi-fidelity bayesian optimization: a new paradigm for parameter estimation in haemodynamics, and beyond. Journal of The Royal Society Interface, 13(118):20151107, 2016.
- Learning cardiac activation maps from 12-lead ecg with multi-fidelity bayesian optimization on manifolds. arXiv preprint arXiv:2203.06222, 2022.
- Predicting the output from a complex computer code when fast approximations are available. Biometrika, 87(1):1–13, 2000.
- Variable-fidelity probability of improvement method for efficient global optimization of expensive black-box problems. Structural and Multidisciplinary Optimization, 62(6):3021–3052, 2020.
- Sequential kriging optimization using multiple-fidelity evaluations. Structural and Multidisciplinary Optimization, 32(5):369–382, 2006.
- Information-based multi-fidelity bayesian optimization. In NIPS Workshop on Bayesian Optimization, 2017.
- Multi-fidelity bayesian optimization with max-value entropy search and its parallelization. In International Conference on Machine Learning, pages 9334–9345. PMLR, 2020.
- Naoki Abe. Query learning strategies using boosting and bagging. Proc. of 15^< th> Int. Cmf. on Machine Learning (ICML98), pages 1–9, 1998.
- Active learning for regression based on query by committee. In International conference on intelligent data engineering and automated learning, pages 209–218. Springer, 2007.
- Batch mode active learning for regression with expected model change. IEEE transactions on neural networks and learning systems, 28(7):1668–1681, 2016.
- A multiple criteria active learning method for support vector regression. Pattern recognition, 47(7):2558–2567, 2014.
- Minimisation of data collection by active learning. In Proceedings of ICNN’95-International Conference on Neural Networks, volume 3, pages 1338–1341. IEEE, 1995.
- An analysis of active learning strategies for sequence labeling tasks. In proceedings of the 2008 conference on empirical methods in natural language processing, pages 1070–1079, 2008.
- Query by committee. In Proceedings of the fifth annual workshop on Computational learning theory, pages 287–294, 1992.
- An active learning approach with uncertainty, representativeness, and diversity. The Scientific World Journal, 2014, 2014.
- Multi-criteria-based active learning for named entity recognition. In Proceedings of the 42nd annual meeting of the Association for Computational Linguistics (ACL-04), pages 589–596, 2004.
- Offline eeg-based driver drowsiness estimation using enhanced batch-mode active learning (ebmal) for regression. In 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pages 000730–000736. IEEE, 2016.
- Robert Munro Monarch. Human-in-the-Loop Machine Learning: Active learning and annotation for human-centered AI. Simon and Schuster, 2021.
- Heterogeneous uncertainty sampling for supervised learning. In Machine learning proceedings 1994, pages 148–156. Elsevier, 1994.
- Margin based active learning. In International Conference on Computational Learning Theory, pages 35–50. Springer, 2007.
- Confidence-based active learning. IEEE transactions on pattern analysis and machine intelligence, 28(8):1251–1261, 2006.
- Entropy-based active learning for object recognition. In 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pages 1–8. IEEE, 2008.
- David Cohn. Neural network exploration using optimal experiment design. Advances in neural information processing systems, 6, 1993.
- Maximizing expected model change for active learning in regression. In 2013 IEEE 13th international conference on data mining, pages 51–60. IEEE, 2013.
- Research on query-by-committee method of active learning and application. In International Conference on Advanced Data Mining and Applications, pages 985–991. Springer, 2006.
- Hierarchical sampling for active learning. In Proceedings of the 25th international conference on Machine learning, pages 208–215, 2008.
- A geometric approach to active learning for convolutional neural networks. arXiv preprint arXiv:1708.00489, 7, 2017.
- Batch mode active sampling based on marginal probability distribution matching. ACM Transactions on Knowledge Discovery from Data (TKDD), 7(3):1–25, 2013.
- Active learning without knowing individual instance labels: a pairwise label homogeneity query approach. IEEE Transactions on Knowledge and Data Engineering, 26(4):808–822, 2013.
- Transductive active learning–a new semi-supervised learning approach based on iteratively refined generative models to capture structure in data. Information Sciences, 293:275–298, 2015.
- A novel active learning framework for classification: Using weighted rank aggregation to achieve multiple query criteria. Pattern Recognition, 93:581–602, 2019.
- Active learning by learning. In Twenty-Ninth AAAI conference on artificial intelligence, 2015.
- Adaptive active learning for image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 859–866, 2013.
- Querying discriminative and representative samples for batch mode active learning. ACM Transactions on Knowledge Discovery from Data (TKDD), 9(3):1–23, 2015.
- Self-paced active learning: Query the right thing at the right time. In Proceedings of the AAAI conference on artificial intelligence, volume 33, pages 5117–5124, 2019.
- J Mockus. On the bayes methods for seeking the extremal point. IFAC Proceedings Volumes, 8(1):428–431, 1975.
- Donald R Jones. A taxonomy of global optimization methods based on response surfaces. Journal of global optimization, 21(4):345–383, 2001.
- Rectified max-value entropy search for bayesian optimization. arXiv preprint arXiv:2202.13597, 2022.
- Incremental relabeling for active learning with noisy crowdsourced annotations. In 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, pages 728–733. IEEE, 2011.
- Efficiently learning the accuracy of labeling sources for selective sampling. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 259–268, 2009.
- Repeated labeling using multiple noisy labelers. Data Mining and Knowledge Discovery, 28(2):402–441, 2014.
- Active learning from crowds. In ICML, 2011.
- Active learning from multiple knowledge sources. In Artificial Intelligence and Statistics, pages 1350–1357. PMLR, 2012.
- Active learning for crowdsourcing using knowledge transfer. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 28, 2014.
- Cost-effective active learning from diverse labelers. In IJCAI, pages 1879–1885, 2017.
- Cmal: Cost-effective multi-label active learning by querying subexamples. IEEE Transactions on Knowledge and Data Engineering, 2020.
- Cost-accuracy aware adaptive labeling for active learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 2569–2576, 2020.
- Sequential optimization using multi-level cokriging and extended expected improvement criterion. Structural and Multidisciplinary Optimization, 58(3):1155–1173, 2018.
- Michael D McKay. Latin hypercube sampling as a tool in uncertainty analysis of computer models. In Proceedings of the 24th conference on Winter simulation, pages 557–564, 1992.
- Engineering design via surrogate modelling: a practical guide. John Wiley & Sons, 2008.
- Parameter selection in synchronous and asynchronous deterministic particle swarm optimization for ship hydrodynamics problems. Applied Soft Computing, 49:313–334, 2016.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.