Learning in Inverse Optimization: Incenter Cost, Augmented Suboptimality Loss, and Algorithms (2305.07730v2)
Abstract: In Inverse Optimization (IO), an expert agent solves an optimization problem parametric in an exogenous signal. From a learning perspective, the goal is to learn the expert's cost function given a dataset of signals and corresponding optimal actions. Motivated by the geometry of the IO set of consistent cost vectors, we introduce the "incenter" concept, a new notion akin to circumcenter recently proposed by Besbes et al. (2023). Discussing the geometric and robustness interpretation of the incenter cost vector, we develop corresponding tractable convex reformulations, which are in contrast with the circumcenter, which we show is equivalent to an intractable optimization program. We further propose a novel loss function called Augmented Suboptimality Loss (ASL), a relaxation of the incenter concept for problems with inconsistent data. Exploiting the structure of the ASL, we propose a novel first-order algorithm, which we name Stochastic Approximate Mirror Descent. This algorithm combines stochastic and approximate subgradient evaluations, together with mirror descent update steps, which is provably efficient for the IO problems with discrete feasible sets with high cardinality. We implement the IO approaches developed in this paper as a Python package called InvOpt. Our numerical experiments are reproducible, and the underlying source code is available as examples in the InvOpt package.
- The inverse optimal value problem. Mathematical programming, 102:91–110, 2005.
- Inverse optimization. Operations research, 49(5):771–783, 2001.
- Learning for control: An inverse optimization approach. IEEE Control Systems Letters, 2021.
- Linear coupling: An ultimate unification of gradient and mirror descent. arXiv preprint arXiv:1407.1537, 2014.
- Inverse optimization with noisy data. Operations Research, 66(3):870–892, 2018.
- Emulating the expert: Inverse optimization through online learning. In International Conference on Machine Learning, pages 400–410. PMLR, 2017.
- Dimitri Bertsekas. Convex optimization algorithms. Athena Scientific, 2015.
- Dimitri P. Bertsekas. Nonlinear programming. Athena Scientific, 2008.
- Inverse optimization: A new perspective on the black-litterman model. Operations research, 60(6):1389–1403, 2012.
- Data-driven estimation in equilibrium using inverse optimization. Mathematical Programming, 153(2):595–633, 2015.
- Contextual inverse optimization: Offline and online learning. Operations Research, 2023.
- Inverse mixed integer optimization: Polyhedral insights and trust region methods. INFORMS Journal on Computing, 34(3):1471–1488, 2022.
- Convex optimization. Cambridge university press, 2004.
- S. Bubeck. Convex Optimization: Algorithms and Complexity. Foundations and Trends in Machine Learning. Now Publishers, 2015.
- On an instance of the inverse shortest paths problem. Mathematical programming, 53:45–61, 1992.
- Generalized inverse multiobjective optimization with application to cancer therapy. Operations Research, 62(3):680–695, 2014.
- An inverse optimization approach to measuring clinical pathway concordance. Management Science, 68(3):1882–1903, 2022.
- Inverse optimization: Closed-form solutions, geometry, and goodness of fit. Management Science, 65(3):1115–1135, 2019.
- Inverse optimization: Theory and applications. arXiv preprint arXiv:2109.03920, 2021.
- Online convex optimization perspective for learning from dynamically revealed preferences. arXiv preprint arXiv:2008.10460, 2020.
- Inverse optimization with endogenous arrival time constraints to calibrate the household activity pattern problem. Transportation Research Part B: Methodological, 46(3):463–479, 2012.
- Integer Programming. Graduate Texts in Mathematics. Springer International Publishing, 2014.
- UCI machine learning repository, 2017.
- Generalization bounds in the predict-then-optimize framework. Advances in neural information processing systems, 2019.
- Smart “predict, then optimize”. Management Science, 68(1):9–26, 2022.
- Inverse optimization in high-speed networks. Discrete Applied Mathematics, 129(1):83–98, 2003.
- Robust inverse optimization. Operations Research Letters, 46(3):339–344, 2018.
- Inferring linear feasible regions using inverse optimization. European Journal of Operational Research, 290(3):829–843, 2021.
- Clemens Heuberger. Inverse combinatorial optimization: A survey on problems, methods, and results. Journal of combinatorial optimization, 8:329–361, 2004.
- Integral boundary points of convex polyhedra. 50 Years of Integer Programming 1958-2008: From the Early Years to the State-of-the-Art, pages 49–76, 2010.
- Inverse conic programming with applications. Operations Research Letters, 33(3):319–330, 2005.
- Cutting-plane training of structural SVMs. Machine learning, 77(1):27–59, 2009.
- First order methods for nonsmooth convex large-scale optimization, i: general purpose methods. Optimization for Machine Learning, 30(9):121–148, 2011.
- Imputing a convex objective function. In 2011 IEEE International Symposium on Intelligent Control, pages 613–619, 2011.
- A simpler approach to obtaining an o (1/t) convergence rate for the projected stochastic subgradient method. arXiv preprint arXiv:1212.2002, 2012.
- Ssvm: A smooth support vector machine for classification. Computational optimization and Applications, 20(1):5–22, 2001.
- Relatively smooth convex optimization by first-order methods, and applications. SIAM Journal on Optimization, 28(1):333–354, 2018.
- Data-driven inverse optimization with imperfect information. Mathematical Programming, 167(1):191–234, 2018.
- Jorge J Moré. Generalizations of the trust region problem. Optimization methods and Software, 2(3-4):189–209, 1993.
- Arkadi Nemirovski. Lecture notes: Interior point polynomial methods in convex programming, 1996. URL: https://www2.isye.gatech.edu/~nemirovs/Lect_IPM.pdf. Last visited on 2022/06/23.
- Arkadi Nemirovski. Prox-method with rate of convergence O(1/t) for variational inequalities with lipschitz continuous monotone operators and smooth convex-concave saddle point problems. SIAM Journal on Optimization, 15(1):229–251, 2004.
- Yurii Nesterov. Dual extrapolation and its applications to solving variational inequalities and related problems. Mathematical Programming, 109(2-3):319–344, 2007.
- Structured learning and prediction in computer vision. Foundations and Trends® in Computer Graphics and Vision, 6(3–4):185–365, 2011.
- Francesco Orabona. A modern introduction to online learning. arXiv preprint arXiv:1912.13213, 2019.
- The generalized trust region subproblem. Computational optimization and applications, 58(2):273–322, 2014.
- Maximum margin planning. In Proceedings of the 23rd international conference on Machine learning, pages 729–736, 2006.
- (approximate) subgradient methods for structured prediction. In Artificial Intelligence and Statistics, pages 380–387. PMLR, 2007.
- Short-term forecasting of price-responsive loads using inverse optimization. IEEE Transactions on Smart Grid, 9(5):4805–4814, 2017.
- Sartaj Sahni. Computationally related problems. SIAM Journal on computing, 3(4):262–279, 1974.
- Andrew J Schaefer. Inverse integer programming. Optimization Letters, 3:483–489, 2009.
- Naum Zuselevich Shor. Minimization methods for non-differentiable functions. Springer Series in Computational Mathematics. Springer, 1985.
- An inductive learning approach to prognostic prediction. In Machine Learning Proceedings 1995, pages 522–530. Elsevier, 1995.
- Learning structured prediction models: A large margin approach. In Proceedings of the 22nd international conference on Machine learning, pages 896–903, 2005.
- Structured prediction, dual extragradient and bregman projections. Journal of Machine Learning Research, 7(7), 2006.
- Semi-discrete optimal transport: Hardness, regularization and numerical solution. Mathematical Programming, pages 1–74, 2022.
- Bundle methods for regularized risk minimization. Journal of Machine Learning Research, 11(1), 2010.
- Robert Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1):267–288, 1996.
- Large margin methods for structured and interdependent output variables. Journal of machine learning research, 6(9), 2005.
- The generalized trust region subproblem: solution complexity and convex hull results. Mathematical Programming, 191(2):445–486, 2022.
- Lizhi Wang. Cutting plane algorithms for the inverse mixed integer linear programming problem. Operations research letters, 37(2):114–116, 2009.
- A support vector machine approach to breast cancer diagnosis and prognosis. In IFIP international conference on artificial intelligence applications and innovations, pages 500–507. Springer, 2006.
- Pedro Zattoni Scroccaro. InvOpt: An open-source Python package to solve Inverse Optimization problems. https://github.com/pedroszattoni/invopt, 2023.