Constraint-Informed Learning for Warm Starting Trajectory Optimization (2312.14336v2)
Abstract: Future spacecraft and surface robotic missions require increasingly capable autonomy stacks for exploring challenging and unstructured domains, and trajectory optimization will be a cornerstone of such autonomy stacks. However, the nonlinear optimization solvers required remain too slow for use on relatively resource-constrained flight-grade computers. In this work, we turn towards amortized optimization, a learning-based technique for accelerating optimization run times, and present TOAST: Trajectory Optimization with Merit Function Warm Starts. Offline, using data collected from a simulation, we train a neural network to learn a mapping to the full primal and dual solutions given the problem parameters. Crucially, we build upon recent results from decision-focused learning and present a set of decision-focused loss functions using the notion of merit functions for optimization problems. We show that training networks with such constraint-informed losses can better encode the structure of the trajectory optimization problem and jointly learn to reconstruct the primal-dual solution while yielding improved constraint satisfaction. Through numerical experiments on a Lunar rover problem and a 3-degrees-of-freedom Mars powered descent guidance problem, we demonstrate that TOAST outperforms benchmark approaches in terms of both computation times and network prediction constraint satisfaction.
- B. Amos. Tutorial on amortized optimization. Foundations and Trends in Machine Learning, 16(5):732, 2023.
- CasADi: A software framework for nonlinear optimization and optimal control. Mathematical Programming Computation, 11(1):1–36, 2019.
- J. T. Betts. Survey of numerical methods for trajectory optimization. AIAA Journal of Guidance, Control, and Dynamics, 21(2):193–207, 1998.
- S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge Univ. Press, 2004.
- Improving computational efficiency for powered descent guidance via transformer-based tight constraint prediction. In AIAA Scitech Forum, 2024.
- PRISM: Recurrent neural networks and presolve methods for fast mixed-integer optimal control. In Learning for Dynamics & Control, 2022a.
- CoCo: Online mixed-integer control via supervised learning. IEEE Robotics and Automation Letters, 7(2):1447–1454, 2022b.
- Large scale model predictive control with neural networks and primal active sets. Automatica, 135:109947, 2022.
- MPC on a chip - recent advances on the application of multi-parametric model-based control. Computers & Chemical Engineering, 32(4-5):754–765, 2008.
- Model predictive control in aerospace systems: Current state and opportunities. AIAA Journal of Guidance, Control, and Dynamics, 40(7):1541–1566, 2017.
- On the behavior of Lagrange multipliers in convex and nonconvex infeasible interior point methods. Mathematical Programming, 186:257–288, 2021.
- Deep learning can accelerate grasp-optimized motion planning. Science Robotics, 5(48):1–12, 2020.
- Endurance: Lunar South Pole-Atken Basin traverse and sample return rover. Technical report, National Academy Press, 2022.
- M. Kelly. An introduction to trajectory optimization: How to do your own direct collocation. SIAM Review, 59(4):849 – 904, 2017.
- Adam: A method for stochastic optimization. In Int. Conf. on Learning Representations, 2015.
- End-to-end constrained optimization learning: A survey. In Int. Joint Conf. on Artificial Intelligence, 2021.
- Model predictive trajectory optimization and tracking for on-road autonomous vehicles. In Proc. IEEE Int. Conf. on Intelligent Transportation Systems, 2018.
- Decision-focused learning: Foundations, state of the art, benchmark and future opportunities, 2023. Available at https://arxiv.org/abs/2307.13565.
- Origins, worlds, and life: A decadal strategy for planetary science and astrobiology 2023–2032. Technical report, National Academy Press, 2022.
- J. Nocedal and S. J. Wright. Numerical Optimization. Springer, second edition, 2006.
- Automatic differentiation in PyTorch. In Conf. on Neural Information Processing Systems - Autodiff Workshop, 2017.
- Driving Curiosity: Mars Rover mobility trends during the first seven years. In IEEE Aerospace Conference, 2020.
- Imitation learning from MPC for quadrupedal multi-gait control. In Proc. IEEE Conf. on Robotics and Automation, 2021.
- Machine learning based relative orbit transfer for swarm spacecraft motion planning. In IEEE Aerospace Conference, 2022.
- End-to-end learning to warm-start for real-time quadratic optimization. In Learning for Dynamics & Control, 2023.
- Intriguing properties of neural networks, 2014. Available at https://arxiv.org/abs/1312.6199.
- Demonstration-efficient guided policy search via imitation of robust tube MPC. In Proc. IEEE Conf. on Robotics and Automation, 2022.
- Learning trajectories for real-time optimal control of quadrotors. In IEEE/RSJ Int. Conf. on Intelligent Robots & Systems, 2018.
- Autonomous robotics is driving Perseverance rover’s progress on Mars. Science Robotics, 8(80):1–12, 2023.
- A. Wächter and L. T. Biegler. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Mathematical Programming, 106(1):25–57, 2006.
- Melding the data-decisions pipeline: Decision-focused learning for combinatorial optimization. In Proc. AAAI Conf. on Artificial Intelligence, 2019.
- Safe and near-optimal policy learning for model predictive control using primal-dual neural networks. In American Control Conference, 2019.