Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

iMotion-LLM: Motion Prediction Instruction Tuning (2406.06211v2)

Published 10 Jun 2024 in cs.CV

Abstract: We introduce iMotion-LLM: a Multimodal LLMs with trajectory prediction, tailored to guide interactive multi-agent scenarios. Different from conventional motion prediction approaches, iMotion-LLM capitalizes on textual instructions as key inputs for generating contextually relevant trajectories. By enriching the real-world driving scenarios in the Waymo Open Dataset with textual motion instructions, we created InstructWaymo. Leveraging this dataset, iMotion-LLM integrates a pretrained LLM, fine-tuned with LoRA, to translate scene features into the LLM input space. iMotion-LLM offers significant advantages over conventional motion prediction models. First, it can generate trajectories that align with the provided instructions if it is a feasible direction. Second, when given an infeasible direction, it can reject the instruction, thereby enhancing safety. These findings act as milestones in empowering autonomous navigation systems to interpret and predict the dynamics of multi-agent environments, laying the groundwork for future advancements in this field.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Abdulwahab Felemban (2 papers)
  2. Eslam Mohamed Bakr (8 papers)
  3. Xiaoqian Shen (14 papers)
  4. Jian Ding (132 papers)
  5. Abduallah Mohamed (10 papers)
  6. Mohamed Elhoseiny (102 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.