Papers
Topics
Authors
Recent
Search
2000 character limit reached

Low Frequency Sampling in Model Predictive Path Integral Control

Published 3 Apr 2024 in cs.RO and math.OC | (2404.03094v2)

Abstract: Sampling-based model-predictive controllers have become a powerful optimization tool for planning and control problems in various challenging environments. In this paper, we show how the default choice of uncorrelated Gaussian distributions can be improved upon with the use of a colored noise distribution. Our choice of distribution allows for the emphasis on low frequency control signals, which can result in smoother and more exploratory samples. We use this frequency-based sampling distribution with Model Predictive Path Integral (MPPI) in both hardware and simulation experiments to show better or equal performance on systems with various speeds of input response.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (19)
  1. P. T. Boggs and J. W. Tolle, “Sequential quadratic programming,” Acta numerica, vol. 4, pp. 1–51, 1995.
  2. R. Rubinstein, “The cross-entropy method for combinatorial and continuous optimization,” Methodology and computing in applied probability, vol. 1, pp. 127–190, 1999. [Online]. Available: https://link.springer.com/article/10.1023/A:1010091220143
  3. G. Williams, P. Drews, B. Goldfain, J. M. Rehg, and E. A. Theodorou, “Aggressive Driving with Model Predictive Path Integral Control,” in 2016 IEEE International Conference on Robotics and Automation (ICRA).   IEEE, 2016, pp. 1433–1440. [Online]. Available: https://ieeexplore.ieee.org/document/7487277/
  4. ——, “Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving,” IEEE Transactions on Robotics, vol. 34, no. 6, pp. 1603–1622, 2018. [Online]. Available: https://ieeexplore.ieee.org/document/8558663/
  5. Z. Wang, O. So, K. Lee, and E. A. Theodorou, “Adaptive Risk Sensitive Model Predictive Control with Stochastic Search,” in Proceedings of the 3rd Conference on Learning for Dynamics and Control.   PMLR, 2021, pp. 510–522. [Online]. Available: https://proceedings.mlr.press/v144/wang21b.html
  6. J. Watson and J. Peters, “Inferring smooth control: Monte carlo posterior policy iteration with gaussian processes,” in Conference on Robot Learning.   PMLR, 2023, pp. 67–79. [Online]. Available: https://proceedings.mlr.press/v205/watson23a/watson23a.pdf
  7. I. S. Mohamed, K. Yin, and L. Liu, “Autonomous Navigation of AGVs in Unknown Cluttered Environments: Log-MPPI Control Strategy,” IEEE Robotics and Automation Letters, vol. 7, no. 4, pp. 10 240–10 247, 2022. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/9834098
  8. T. Kim, G. Park, K. Kwak, J. Bae, and W. Lee, “Smooth Model Predictive Path Integral Control Without Smoothing,” IEEE Robotics and Automation Letters, vol. 7, no. 4, pp. 10 406–10 413, 2021. [Online]. Available: https://ieeexplore.ieee.org/document/9835021
  9. Z. Wang, O. So, J. Gibson, B. Vlahov, M. S. Gandhi, G.-H. Liu, and E. A. Theodorou, “Variational Inference MPC using Tsallis Divergence,” in Robotics: Science and Systems XVII.   Robotics: Science and Systems Foundation, Apr. 2021. [Online]. Available: https://www.roboticsproceedings.org/rss17/p073.html
  10. A. Lambert, F. Ramos, B. Boots, D. Fox, and A. Fishman, “Stein Variational Model Predictive Control,” in Proceedings of the 2020 Conference on Robot Learning.   PMLR, 2021, pp. 1278–1297. [Online]. Available: https://proceedings.mlr.press/v155/lambert21a.html
  11. J. Sacks and B. Boots, “Learning Sampling Distributions for Model Predictive Control,” in Proceedings of The 6th Conference on Robot Learning.   PMLR, Mar. 2023, pp. 1733–1742. [Online]. Available: https://proceedings.mlr.press/v205/sacks23a.html
  12. T. Power and D. Berenson, “Variational Inference MPC using Normalizing Flows and Out-of-Distribution Projection,” in Robotics: Science and Systems XVIII.   Robotics: Science and Systems Foundation, Jun. 2022. [Online]. Available: http://www.roboticsproceedings.org/rss18/p027.pdf
  13. C. Pinneri, J. Achterhold, M. Rolínek, and G. Martius, “Sample-efficient Cross-Entropy Method for Real-time Planning,” in Proceedings of the 2020 Conference on Robot Learning.   PMLR, 2021, pp. 1049–1065. [Online]. Available: https://proceedings.mlr.press/v155/pinneri21a.html
  14. Z. Zhang, J. Chen, Z. Chen, and W. Li, “Asynchronous Episodic Deep Deterministic Policy Gradient: Toward Continuous Control in Computationally Complex Environments,” IEEE Transactions on Cybernetics, vol. 51, no. 2, pp. 604–613, 2021. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/8946888
  15. J. Timmer and M. König, “On generating power law noise,” Astronomy & Astrophysics, vol. 300, pp. 707–710, 1995. [Online]. Available: http://jeti.uni-freiburg.de/papers/timmer95.pdf
  16. R. Bracewell, “The Fourier Transform and its Applications.”   McGraw-Hill, 2000, ch. 2.
  17. E. Zhou and J. Hu, “Gradient-Based Adaptive Stochastic Search for Non-Differentiable Optimization,” IEEE Transactions on Automatic Control, vol. 59, no. 7, pp. 1818–1832, 2014. [Online]. Available: https://ieeexplore.ieee.org/document/6756948
  18. J. Gibson, B. Vlahov, D. Fan, P. Spieler, D. Pastor, A.-a. Agha-mohammadi, and E. A. Theodorou, “A Multi-step Dynamics Modeling Framework For Autonomous Driving In Multiple Environments,” in 2023 IEEE International Conference on Robotics and Automation (ICRA).   IEEE, May 2023, pp. 7959–7965. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/10161330
  19. Y. Song, S. Naji, E. Kaufmann, A. Loquercio, and D. Scaramuzza, “Flightmare: A Flexible Quadrotor Simulator,” in Proceedings of the 2020 Conference on Robot Learning.   PMLR, 2021, pp. 1147–1157. [Online]. Available: https://proceedings.mlr.press/v155/song21a.html
Citations (4)

Summary

  • The paper presents a novel colored noise sampling approach in MPPI control to reduce high-frequency noise and achieve smoother trajectories.
  • It implements frequency-domain sampling using power spectral density and iFFT techniques, refining the optimization process with minimal adjustments.
  • Experimental validations on off-road vehicles and quadrotors demonstrate improved maneuverability, reduced chatter, and faster system stabilization.

Overview of "Low Frequency Sampling in Model Predictive Path Integral Control"

The paper presents advancements in sampling methods for Model Predictive Path Integral (MPPI) control by introducing a colored noise distribution for sampling, in contrast to the traditional uncorrelated Gaussian distribution. The primary objective of this research is to improve the control inputs generated by MPPI, facilitating smoother trajectories and enhanced exploration through a focus on low-frequency control signals. This paper details both simulation and hardware experiments, demonstrating the advantages of colored noise sampling in achieving better performance and smoother trajectories across various scenarios.

Introduction to Sampling-based Model Predictive Control

Sampling-based methods for model-predictive control (MPC) have become essential in navigating complex environments. MPPI, a stochastic optimal control method, has gained prominence due to its applicability to non-linear dynamics without stringent differentiability requirements. However, conventional Gaussian sampling methods have limitations, particularly producing control inputs with high-frequency noise, which can lead to chattering and degradation in system performance over time.

The paper hypothesizes that using colored noise, which emphasizes low-frequency components, can mitigate these issues by generating smoother control signals. Such signals are more akin to human-input trajectories and beneficial for systems with slow input-response characteristics. Figure 1

Figure 1

Figure 1: The off-road vehicle in a desert terrain just before an autonomy test and a screenshot of the Flightmare quadrotor simulator.

Mathematical Formulation and Implementation

The paper introduces colored noise sampling by leveraging power spectral density (PSDPSD) concepts, where noise distributions are crafted with PSD(f)1fγPSD(f) \propto \frac{1}{f^\gamma}. This prioritization of low-frequency components is achieved by sampling Gaussian components in the frequency domain and utilizing an inverse Fast Fourier Transform (iFFT) to convert them into time-domain control signals.

This approach retains the theoretical underpinnings of MPPI, requiring only minor adjustments to the optimization process, namely in computing the optimal control trajectory using weighted averages of sampled trajectories. The proposed frequency-based sampling method is seamlessly integrated into the MPPI framework with adjusted update rules and control law derivations. Figure 2

Figure 2

Figure 2: Samples of state trajectories generated from frequency-based and Gaussian control inputs highlighting smoother and exploratory results with the former.

Experimental Validation

The research involves comprehensive experiments, including:

Off-road Vehicle Platform

Testing on an off-road vehicle highlights the limitations of Gaussian sampling where quick, successive turns are necessary. The colored noise method outperformed Gaussian sampling, enabling the vehicle to negotiate a zig-zag corridor with significantly fewer manual interventions and smoother control trajectories. Figure 3

Figure 3

Figure 3: Hardware experiments showing the vehicle's maneuverability in a zig-zag corridor using different sampling methods.

Simulated Quadrotor Environment

In a simulated quadrotor environment using the Flightmare simulator, colored noise sampling achieved fast lap times with reduced control signal chatter compared to Gaussian methods. This is critical in real-world applications to prevent hardware fatigue and failure. Figure 4

Figure 4

Figure 4

Figure 4

Figure 4: Quadrotor flight trajectories comparing Gaussian and Colored sampling, highlighting smoother control inputs with the latter.

Double Integrator Tests

Finally, tests on a double integrator system reveal that colored noise sampling allows for more rapid system stabilization, as it generates control impulses effectively tailored to system dynamics. The method shows robustness across different system configurations. Figure 5

Figure 5

Figure 5

Figure 5: State and control trajectories with energy power spectral density for quick system stabilization.

Conclusion

This work introduces a novel sampling distribution for MPPI, showcasing its advantages in producing smoother and more efficient control inputs. By emphasizing low-frequency control inputs, the colored noise method enhances exploration capabilities while reducing control signal chatter. These improvements are validated through rigorous experiments involving real and simulated environments, suggesting that frequency-based sampling provides a general framework adaptable to various control scenarios. Future work could explore integrating this method with other advanced MPPI techniques and extending its application to more complex, real-world environments.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.