Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

BO4IO: A Bayesian optimization approach to inverse optimization with uncertainty quantification (2405.17875v1)

Published 28 May 2024 in math.OC and cs.LG

Abstract: This work addresses data-driven inverse optimization (IO), where the goal is to estimate unknown parameters in an optimization model from observed decisions that can be assumed to be optimal or near-optimal solutions to the optimization problem. The IO problem is commonly formulated as a large-scale bilevel program that is notoriously difficult to solve. Deviating from traditional exact solution methods, we propose a derivative-free optimization approach based on Bayesian optimization, which we call BO4IO, to solve general IO problems. We treat the IO loss function as a black box and approximate it with a Gaussian process model. Using the predicted posterior function, an acquisition function is minimized at each iteration to query new candidate solutions and sequentially converge to the optimal parameter estimates. The main advantages of using Bayesian optimization for IO are two-fold: (i) it circumvents the need of complex reformulations of the bilevel program or specialized algorithms and can hence enable computational tractability even when the underlying optimization problem is nonconvex or involves discrete variables, and (ii) it allows approximations of the profile likelihood, which provide uncertainty quantification on the IO parameter estimates. We apply the proposed method to three computational case studies, covering different classes of forward optimization problems ranging from convex nonlinear to nonconvex mixed-integer nonlinear programs. Our extensive computational results demonstrate the efficacy and robustness of BO4IO to accurately estimate unknown model parameters from small and noisy datasets. In addition, the proposed profile likelihood analysis has proven to be effective in providing good approximations of the confidence intervals on the parameter estimates and assessing the identifiability of the unknown parameters.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (90)
  1. Inverse optimization. Operations research, 49(5):771–783, 2001.
  2. Objective selection for cancer treatment: An inverse optimization approach. Operations Research, 70(3):1717–1738, 2022.
  3. Inverse optimization with noisy data. Operations Research, 66(3):870–892, 2018.
  4. Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3(Nov):397–422, 2002.
  5. Botorch: A framework for efficient monte-carlo bayesian optimization. Advances in neural information processing systems, 33:21524–21538, 2020.
  6. Computational procedures for optimal experimental design in biological systems. IET systems biology, 2(4):163–172, 2008.
  7. Optimal experimental design for parameter estimation of a cell signaling model. PLoS computational biology, 5(11):e1000558, 2009.
  8. On structural identifiability. Mathematical biosciences, 7(3-4):329–339, 1970.
  9. Algorithms for hyper-parameter optimization. Advances in neural information processing systems, 24, 2011.
  10. No-regret bayesian optimization with unknown hyperparameters. Journal of Machine Learning Research, 20(50):1–24, 2019.
  11. Data-driven estimation in equilibrium using inverse optimization. Mathematical Programming, 153:595–633, 2015.
  12. Domino: Data-driven optimization of bi-level mixed-integer nonlinear problems. Journal of Global Optimization, 78:1–36, 2020.
  13. Metabolic flux analysis of hybridoma cells in different culture media using mass balances. Biotechnology and bioengineering, 50(3):299–318, 1996.
  14. Optimization-based framework for inferring and testing hypothesized metabolic objective functions. Biotechnology and bioengineering, 82(6):670–677, 2003.
  15. On an instance of the inverse shortest paths problem. Mathematical programming, 53:45–61, 1992.
  16. Pyomo–optimization modeling in python, volume 67. Springer Science & Business Media, third edition, 2021.
  17. Inverse optimization for the recovery of constraint parameters. European Journal of Operational Research, 282(2):415–427, 2020.
  18. Generalized inverse multiobjective optimization with application to cancer therapy. Operations Research, 62(3):680–695, 2014.
  19. Inverse optimization: Theory and applications. Operations Research, 2023.
  20. Differential bees flux balance analysis with optknock for in silico microbial strains optimization. PloS one, 9(7):e102744, 2014.
  21. Parameter and structural identifiability concepts and ambiguities: a critical review and analysis. American Journal of Physiology-Regulatory, Integrative and Comparative Physiology, 239(1):R7–R24, 1980.
  22. Deep gaussian processes. In Artificial intelligence and statistics, pages 207–215. PMLR, 2013.
  23. Bilevel optimization by conditional bayesian optimization. In International Conference on Machine Learning, Optimization, and Data Science, pages 243–258. Springer, 2023.
  24. High-dimensional bayesian optimization with sparse axis-aligned subspaces. In Uncertainty in Artificial Intelligence, pages 493–503. PMLR, 2021.
  25. Inverse optimization with kernel regression: Application to the power forecasting and bidding of a fleet of electric vehicles. Computers & Operations Research, 134:105405, 2021.
  26. A bilinear approach to the pooling problem. Optimization, 24(1-2):165–180, 1992.
  27. Peter I Frazier. A tutorial on bayesian optimization. arXiv preprint arXiv:1807.02811, 2018.
  28. Bayesian optimization for materials design. Information science for materials discovery and design, pages 45–75, 2016.
  29. Comparison and analysis of objective functions in flux balance analysis. Biotechnology progress, 30(5):985–991, 2014.
  30. Predictive potential of flux balance analysis of saccharomyces cerevisiae using as optimization function combinations of cell compartmental objectives. 2012.
  31. Gpytorch: Blackbox matrix-matrix gaussian process inference with gpu acceleration. Advances in neural information processing systems, 31, 2018.
  32. Archis Ghate. Imputing radiobiological parameters of the linear-quadratic dose-response model from a radiotherapy fractionation plan. Physics in Medicine & Biology, 65(22):225009, 2020.
  33. Inferring linear feasible regions using inverse optimization. European Journal of Operational Research, 290(3):829–843, 2021.
  34. Deep Learning. MIT Press, 2016. http://www.deeplearningbook.org.
  35. Bayesian optimization for adaptive experimental design: A review. IEEE access, 8:13937–13948, 2020.
  36. Rishabh Gupta and Qi Zhang. Decomposition and adaptive sampling for data-driven inverse linear optimization. INFORMS Journal on Computing, 34(5):2720–2735, 2022.
  37. Rishabh Gupta and Qi Zhang. Efficient learning of decision-making models: A penalty block coordinate descent algorithm for data-driven inverse optimization. Computers & Chemical Engineering, 170:108123, 2023.
  38. Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual, 2023. URL https://www.gurobi.com.
  39. Co A Haverly. Studies of the behavior of recursion for the pooling problem. Acm sigmap bulletin, (25):19–28, 1978.
  40. Sequential model-based optimization for general algorithm configuration. In Learning and Intelligent Optimization: 5th International Conference, LION 5, Rome, Italy, January 17-21, 2011. Selected Papers 5, pages 507–523. Springer, 2011.
  41. Snobfit–stable noisy optimization by branch and fit. ACM Transactions on Mathematical Software (TOMS), 35(2):1–25, 2008.
  42. Inverse conic programming with applications. Operations Research Letters, 33(3):319–330, 2005.
  43. Efficient global optimization of expensive black-box functions. Journal of Global optimization, 13:455–492, 1998.
  44. Exploiting the bootstrap method for quantifying parameter confidence intervals in dynamical systems. Metabolic engineering, 8(5):447–455, 2006.
  45. Imputing a convex objective function. In 2011 IEEE international symposium on intelligent control, pages 613–619. IEEE, 2011.
  46. Bayesian optimization approach of general bi-level problems. In Proceedings of the Genetic and Evolutionary Computation Conference Companion, pages 1614–1621, 2017.
  47. Bigg models: A platform for integrating, standardizing and sharing genome-scale models. Nucleic acids research, 44(D1):D515–D522, 2016.
  48. Systems biology: experimental design. The FEBS journal, 276(4):923–942, 2009.
  49. Constrained robust bayesian optimization of expensive noisy black-box functions with guaranteed regret bounds. AIChE Journal, 68(12):e17857, 2022.
  50. Robust bayesian optimization for flexibility analysis of expensive simulation-based models with rigorous uncertainty bounds. Computers & Chemical Engineering, 181:108515, 2024.
  51. Comparative genome-scale metabolic reconstruction and flux balance analysis of multiple staphylococcus aureus genomes identify novel antimicrobial drug targets. Journal of bacteriology, 191(12):4015–4024, 2009.
  52. Identification of optimal classification functions for biological sample and state discrimination from metabolic profiling data. Bioinformatics, 20(6):959–969, 2004.
  53. Global optimization of nonlinear generalized disjunctive programming with bilinear equality constraints: applications to process networks. Computers & chemical engineering, 27(11):1557–1575, 2003.
  54. Omic data from evolved e. coli are consistent with computed optimal growth from genome-scale models. Molecular systems biology, 6(1):390, 2010.
  55. Jonathan Yu-Meng Li. Inverse optimization of convex risk functions. Management Science, 67(11):7113–7141, 2021.
  56. Driving the model to its limit: profile likelihood based model reduction. PloS one, 11(9):e0162366, 2016.
  57. Advances for the pooling problem: Modeling, global optimization, and computational studies. Applied and Computational Mathematics, 8(1):3–22, 2009.
  58. Jonas Mockus. Application of bayesian approach to numerical methods of global and stochastic optimization. Journal of Global Optimization, 4:347–365, 1994.
  59. Data-driven inverse optimization with imperfect information. Mathematical Programming, 167:191–234, 2018.
  60. Soft constraints-based multiobjective framework for flux balance analysis. Metabolic engineering, 12(5):429–445, 2010.
  61. The use of likelihood-based confidence intervals in genetic models. Behavior genetics, 27:113–120, 1997.
  62. Genome scale metabolic modeling of cancer. Metabolic engineering, 43:103–112, 2017.
  63. Reconstruction and use of microbial metabolic networks: the core escherichia coli metabolic model as an educational guide. EcoSal plus, 4(1):10–1128, 2010a.
  64. What is flux balance analysis? Nature biotechnology, 28(3):245–248, 2010b.
  65. Nonstationary covariance functions for gaussian process regression. Advances in neural information processing systems, 16, 2003.
  66. Flux balance analysis of mixed microbial cultures: Application to the production of polyhydroxyalkanoates from complex mixtures of volatile fatty acids. Journal of biotechnology, 162(2-3):336–345, 2012.
  67. Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics, 25(15):1923–1929, 2009.
  68. Derivative-free optimization: a review of algorithms and comparison of software implementations. Journal of Global Optimization, 56(3):1247–1293, 2013.
  69. Network analysis of intermediary metabolism using linear optimization. i. development of mathematical formalism. Journal of theoretical biology, 154(4):421–454, 1992.
  70. Andrew J Schaefer. Inverse integer programming. Optimization Letters, 3:483–489, 2009.
  71. Paul JH Schoemaker. The quest for optimality: A positive heuristic of science? Behavioral and brain sciences, 14(2):205–215, 1991.
  72. Systematic evaluation of objective functions for predicting intracellular fluxes in escherichia coli. Molecular systems biology, 3(1):119, 2007.
  73. Multidimensional optimality of microbial metabolism. Science, 336(6081):601–604, 2012.
  74. Taking the human out of the loop: A review of bayesian optimization. Proceedings of the IEEE, 104(1):148–175, 2015.
  75. A review on bilevel optimization: From classical to evolutionary approaches and applications. IEEE transactions on evolutionary computation, 22(2):276–295, 2017.
  76. Practical bayesian optimization of machine learning algorithms. Advances in neural information processing systems, 25, 2012.
  77. Scalable bayesian optimization using deep neural networks. In International conference on machine learning, pages 2171–2180. PMLR, 2015.
  78. Michael E Sobel. Asymptotic confidence intervals for indirect effects in structural equation models. Sociological methodology, 13:290–312, 1982.
  79. Bayesian optimization with robust bayesian neural networks. Advances in neural information processing systems, 29, 2016.
  80. Gaussian process optimization in the bandit setting: No regret and experimental design. arXiv preprint arXiv:0912.3995, 2009.
  81. El-Ghazali Talbi. A taxonomy of metaheuristics for bi-level optimization. In Metaheuristics for bi-level optimization, pages 1–39. Springer, 2013.
  82. Investigation of metabolic objectives in cultured hepatocytes. Biotechnology and bioengineering, 97(3):622–637, 2007.
  83. Lizhi Wang. Cutting plane algorithms for the inverse mixed integer linear programming problem. Operations research letters, 37(2):114–116, 2009.
  84. On structural and practical identifiability. Current Opinion in Systems Biology, 25:60–69, 2021.
  85. Gaussian processes for machine learning, volume 2. MIT press Cambridge, MA, 2006.
  86. Learning risk preferences from investment portfolios using inverse optimization. Research in International Business and Finance, 64:101879, 2023.
  87. Inverse optimization for linearly constrained convex separable programming problems. European Journal of Operational Research, 200(3):671–679, 2010.
  88. An augmented lagrangian method for a class of inverse quadratic programming problems. Applied Mathematics and Optimization, 61(1):57, 2010.
  89. The price of anarchy in transportation networks: Data-driven evaluation and reduction strategies. Proceedings of the IEEE, 106(4):538–553, 2018.
  90. Mapping the landscape of metabolic goals of a cell. Genome biology, 17:1–11, 2016.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com