Information Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving (1707.02342v1)

Published 7 Jul 2017 in cs.RO

Abstract: We present an information theoretic approach to stochastic optimal control problems that can be used to derive general sampling based optimization schemes. This new mathematical method is used to develop a sampling based model predictive control algorithm. We apply this information theoretic model predictive control (IT-MPC) scheme to the task of aggressive autonomous driving around a dirt test track, and compare its performance to a model predictive control version of the cross-entropy method.

Citations (233)

View on Semantic Scholar

Summary

The paper introduces a novel sampling-based control law that minimizes the KL divergence using free energy principles.
IT-MPC is implemented on a GPU to achieve real-time performance at around 40Hz on a 1:5 scale autonomous driving platform.
The method outperforms cross-entropy MPC by delivering better trajectory adherence and higher success rates under uncertainty.

Essay on "Information Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving"

The paper "Information Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving" by Williams et al. presents an innovative approach to stochastic optimal control, particularly focusing on autonomous driving applications. By leveraging an information theoretic framework, the authors propose a novel sampling-based model predictive control (MPC) algorithm, which they term the Information Theoretic Model Predictive Control (IT-MPC).

Overview and Methodology

The core contribution of this work is the derivation of a sampling-based control law using an information theoretic perspective on optimal control. By considering the free energy and Kullback-Leibler (KL) divergence, the authors develop a theory that aligns the sampling of control trajectories with the optimization of a lower bound on the control cost function. Specifically, the IT-MPC method aims to minimize the KL divergence between a controlled distribution and an optimal distribution derived through the application of free energy principles.

This approach stands out by addressing tractability issues traditionally associated with full state space methods in autonomous driving, which typically demand significant computational power and are intractable for real-time applications. The authors implemented their algorithm on a GPU, massively parallelizing the sampling process, which enables real-time performance at frequencies around 40Hz. The algorithm computes controls by continuously sampling trajectories and iteratively refining them based on their costs, balanced against a noise model inherent to the controls.

Theoretical Contributions

A significant theoretical achievement of this paper is the connection between information theoretic constructs and classical stochastic optimal control. The free energy, a concept borrowed from physics and information theory, is employed to provide a tractable lower bound on the control problem, offering a new lens through which optimal control problems can be analyzed and solved. Furthermore, the authors link their approach with the existing stochastic Hamilton-Jacobi-BeLLMan framework, showcasing that for certain assumptions, these methodologies align to offer similar control outputs.

Numerical Results and Comparison

The IT-MPC algorithm was evaluated on a real-world autonomous driving platform: a 1:5 scale vehicle tasked with aggressive driving on a dirt track. The authors provide extensive results from over 100 kilometers of driving trials, highlighting the robustness and adaptability of their approach. Key performance metrics such as lap completion times, speed profiles, and trajectory variances were recorded, demonstrating the consistent, reliable performance of IT-MPC in contrast with a baseline cross-entropy MPC approach, especially as speed targets increased.

The authors report that IT-MPC maintained high success rates across various target speeds, significantly outperforming the cross-entropy method in terms of success percentage and trajectory adherence. The practical implementation efficiently managed system disturbances and uncertainties within the model, such as unmodeled tire-ground interactions.

Implications and Future Directions

The practical implications of IT-MPC are broad, particularly in applications where non-linear system dynamics complicate the control problem. Its ability to seamlessly integrate hard constraints and operate effectively even when subjected to significant predictive inaccuracies makes it a strong candidate for deployment in real-world autonomous systems.

Theoretically, this work lays the groundwork for future investigation into information theoretic models in diverse control tasks, beyond autonomous driving. The potential to refine the connections between stochastic control theory and information theory could lead to advancements in other fields where uncertainty and dynamic environments interplay critically.

In conclusion, the development of IT-MPC represents a significant advancement in model predictive control approaches by blending principles from information theory to enhance tractability and efficacy in complex and real-time autonomous driving tasks. Future work may explore optimizing the computational aspects and exploring more generalized stochastic models, thereby broadening the application scope of this promising control paradigm.

PDF Markdown