Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DiffuSolve: Diffusion-based Solver for Non-convex Trajectory Optimization (2403.05571v4)

Published 22 Feb 2024 in cs.RO and cs.LG

Abstract: Optimal trajectory design is computationally expensive for nonlinear and high-dimensional dynamical systems. The challenge arises from the non-convex nature of the optimization problem with multiple local optima, which usually requires a global search. Traditional numerical solvers struggle to find diverse solutions efficiently without appropriate initial guesses. In this paper, we introduce DiffuSolve, a general diffusion model-based solver for non-convex trajectory optimization. An expressive diffusion model is trained on pre-collected locally optimal solutions and efficiently samples initial guesses, which then warm-starts numerical solvers to fine-tune the feasibility and optimality. We also present DiffuSolve+, a novel constrained diffusion model with an additional loss in training that further reduces the problem constraint violations of diffusion samples. Experimental evaluations on three tasks verify the improved robustness, diversity, and a 2$\times$ to 11$\times$ increase in computational efficiency with our proposed method, which generalizes well to trajectory optimization problems of varying challenges.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. Is conditional generative modeling all you need for decision-making? arXiv preprint arXiv:2211.15657, 2022.
  2. Brandon Amos et al. Tutorial on amortized optimization. Foundations and Trends® in Machine Learning, 16(5):592–732, 2023.
  3. Dynamically leveraged automated multibody (n) trajectory optimization. In AAS/AIAA Space Flight Mechanics Conference, Charlotte, NC, 8 2022. American Astronautical Society.
  4. John T Betts. Survey of numerical methods for trajectory optimization. Journal of Guidance, Control, and Dynamics, 21(2):193–207, 1998.
  5. Sequential quadratic programming. Acta numerica, 4:1–51, 1995.
  6. Trajectory generation, control, and safety with denoising diffusion probabilistic models. arXiv preprint arXiv:2306.15512, 2023.
  7. Physics-informed neural networks (pinns) for fluid mechanics: A review. Acta Mechanica Sinica, 37(12):1727–1738, 2021.
  8. Motion planning diffusion: Learning and planning of robot motions with diffusion models. In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.  1916–1923. IEEE, 2023.
  9. Coco: Online mixed-integer control via supervised learning. IEEE Robotics and Automation Letters, 7(2):1447–1454, 2021.
  10. Denoising heat-inspired diffusion with insulators for collision free motion planning. arXiv preprint arXiv:2310.12609, 2023.
  11. Large scale model predictive control with neural networks and primal active sets. Automatica, 135:109947, 2022.
  12. Diffusion policy: Visuomotor policy learning via action diffusion. arXiv preprint arXiv:2303.04137, 2023.
  13. Diffusion models beat gans on image synthesis. Advances in neural information processing systems, 34:8780–8794, 2021.
  14. Diffusion world model. arXiv preprint arXiv:2402.03570, 2024.
  15. Dc3: A learning method for optimization with hard constraints. arXiv preprint arXiv:2104.12225, 2021.
  16. Erwin Fehlberg. Klassische runge-kutta-formeln fünfter und siebenter ordnung mit schrittweiten-kontrolle. Computing, 4(2):93–106, 1969.
  17. The vendi score: A diversity evaluation metric for machine learning. arXiv preprint arXiv:2210.02410, 2022.
  18. Snopt: An sqp algorithm for large-scale constrained optimization. SIAM review, 47(1):99–131, 2005.
  19. Classifier-free diffusion guidance. arXiv preprint arXiv:2207.12598, 2022.
  20. Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
  21. Planning with diffusion for flexible behavior synthesis. arXiv preprint arXiv:2205.09991, 2022.
  22. Amortized global search for efficient preliminary trajectory design with deep generative models. arXiv preprint arXiv:2308.03960, 2023.
  23. Adaptdiffuser: Diffusion models as adaptive self-evolving planners. arXiv preprint arXiv:2302.01877, 2023.
  24. Generative skill chaining: Long-horizon skill planning with diffusion models. In Conference on Robot Learning, pp.  2905–2925. PMLR, 2023.
  25. Constrained stein variational trajectory optimization. arXiv preprint arXiv:2308.12110, 2023.
  26. Sampling constrained trajectories using composable diffusion models. In IROS 2023 Workshop on Differentiable Probabilistic Robotics: Emerging Perspectives on Robot Learning, 2023.
  27. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational physics, 378:686–707, 2019.
  28. End-to-end learning to warm-start for real-time quadratic optimization. In Learning for Dynamics and Control Conference, pp.  220–234. PMLR, 2023.
  29. Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pp.  2256–2265. PMLR, 2015.
  30. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020a.
  31. Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456, 2020b.
  32. Nomad: Goal masked diffusion policies for navigation and exploration. arXiv preprint arXiv:2310.07896, 2023.
  33. Human motion diffusion model. arXiv preprint arXiv:2209.14916, 2022.
  34. Diffusion policies as an expressive policy class for offline reinforcement learning. arXiv preprint arXiv:2208.06193, 2022.
  35. Stephen J Wright. Primal-dual interior-point methods. SIAM, 1997.
  36. Safediffuser: Safe planning with diffusion probabilistic models. arXiv preprint arXiv:2306.00148, 2023.
  37. Compositional diffusion-based continuous constraint solvers. arXiv preprint arXiv:2309.00966, 2023.
  38. Real-time suboptimal model predictive control using a combination of explicit mpc and online optimization. IEEE Transactions on Automatic Control, 56(7):1524–1534, 2011.
  39. Safe and near-optimal policy learning for model predictive control using primal-dual neural networks. In 2019 American Control Conference (ACC), pp.  354–359. IEEE, 2019.
Citations (3)

Summary

We haven't generated a summary for this paper yet.