Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 183 tok/s
Gemini 2.5 Pro 49 tok/s Pro
GPT-5 Medium 29 tok/s Pro
GPT-5 High 24 tok/s Pro
GPT-4o 97 tok/s Pro
Kimi K2 221 tok/s Pro
GPT OSS 120B 440 tok/s Pro
Claude Sonnet 4.5 38 tok/s Pro
2000 character limit reached

LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture (2505.04980v1)

Published 8 May 2025 in cs.RO, cs.SY, and eess.SY

Abstract: This paper proposes a novel Large Vision-LLM (LVLM) and Model Predictive Control (MPC) integration framework that delivers both task scalability and safety for Autonomous Driving (AD). LVLMs excel at high-level task planning across diverse driving scenarios. However, since these foundation models are not specifically designed for driving and their reasoning is not consistent with the feasibility of low-level motion planning, concerns remain regarding safety and smooth task switching. This paper integrates LVLMs with MPC Builder, which automatically generates MPCs on demand, based on symbolic task commands generated by the LVLM, while ensuring optimality and safety. The generated MPCs can strongly assist the execution or rejection of LVLM-driven task switching by providing feedback on the feasibility of the given tasks and generating task-switching-aware MPCs. Our approach provides a safe, flexible, and adaptable control framework, bridging the gap between cutting-edge foundation models and reliable vehicle operation. We demonstrate the effectiveness of our approach through a simulation experiment, showing that our system can safely and effectively handle highway driving while maintaining the flexibility and adaptability of LVLMs.

Summary

LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture

The paper "LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture" presents an innovative framework for autonomous driving applications, integrating a Large Vision-LLM (LVLM) with Model Predictive Control (MPC). This integration aims to ensure both task scalability and safety in autonomous driving environments.

Overview of the Hybrid Architecture

The proposed framework leverages LVLMs to handle high-level task planning across diverse driving scenarios. LVLMs provide symbolic task commands that are processed by MPC Builder, which automatically generates the appropriate MPCs. This hybrid architecture is designed to address two critical challenges in autonomous driving: scalability of task planning and safety assurance during execution.

Scalability and Safety Assurance

Scalability: Traditional model-based planners like MPC often struggle with scalability when combined with the extensive task space of modern LVLMs. The paper suggests that hand-crafting controllers for a limited set of tasks restricts versatility. Hence, the integration of MPC Builder with LVLMs facilitates rapid adaptation to various tasks by synthesizing MPCs from a compact library of primitive design elements.

Safety Assurance: Existing frameworks typically follow a unidirectional pipeline where high-level commands succeed or fail silently without dynamic feedback. The proposed bidirectional communication between LVLMs and MPC ensures that infeasibility is handled effectively through automatic rejection and constructive replanning. The notion of intermediate Optimal Control Problems (iOCP) is introduced to ensure smooth task transitions, mitigating deadlocks and wasted computation.

Simulation and Results

The effectiveness of the LVLM-MPC collaboration is demonstrated through highway driving simulations. The results indicate superior safety and efficiency compared to baseline methods. LVLM-MPCBuilder consistently maintained a 100% success rate in simulations, highlighting its capability to navigate congested traffic safely while accommodating flexible and adaptable task execution.

Implications and Future Work

The integration of LVLMs and MPC presents significant implications for autonomous vehicle design. The framework efficiently bridges the gap between high-level semantic reasoning provided by foundation models and precise, reliable vehicle operation afforded by MPC. Future advancements may deliver more complex scenario handling and real-world deployment. Additionally, expanding this approach to encompass more diverse environmental conditions and task requirements poses intriguing avenues for further research.

In conclusion, this paper articulates a compelling case for the synergistic use of LVLMs and MPC in autonomous driving. The proposed architecture not only meets the demands of scalability and safety but also provides a robust platform for exploring next-generation autonomous vehicle capabilities.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 tweet and received 7 likes.

Upgrade to Pro to view all of the tweets about this paper: