Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks (1705.10868v1)

Published 30 May 2017 in cs.AI, cs.MA, and cs.RO

Abstract: The multi-agent path-finding (MAPF) problem has recently received a lot of attention. However, it does not capture important characteristics of many real-world domains, such as automated warehouses, where agents are constantly engaged with new tasks. In this paper, we therefore study a lifelong version of the MAPF problem, called the multi-agent pickup and delivery (MAPD) problem. In the MAPD problem, agents have to attend to a stream of delivery tasks in an online setting. One agent has to be assigned to each delivery task. This agent has to first move to a given pickup location and then to a given delivery location while avoiding collisions with other agents. We present two decoupled MAPD algorithms, Token Passing (TP) and Token Passing with Task Swaps (TPTS). Theoretically, we show that they solve all well-formed MAPD instances, a realistic subclass of MAPD instances. Experimentally, we compare them against a centralized strawman MAPD algorithm without this guarantee in a simulated warehouse system. TP can easily be extended to a fully distributed MAPD algorithm and is the best choice when real-time computation is of primary concern since it remains efficient for MAPD instances with hundreds of agents and tasks. TPTS requires limited communication among agents and balances well between TP and the centralized MAPD algorithm.

Citations (225)

View on Semantic Scholar

Summary

The paper introduces novel TP and TPTS algorithms that dynamically solve MAPD tasks in evolving multi-agent environments with decentralized planning.
It demonstrates that the TP algorithm scales efficiently for large numbers of agents while TPTS reduces service times through dynamic task swapping.
Comparative evaluations reveal that centralized approaches deliver optimal paths in low-congestion scenarios, emphasizing trade-offs between computation and performance.

Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks

The multi-agent pickup and delivery (MAPD) problem extends the traditional multi-agent path-finding (MAPF) problem to better reflect real-world scenarios where agents continuously address new tasks in an environment, such as automated warehouses or autonomous vehicle coordination. This paper introduces MAPD, presents two novel algorithms—Token Passing (TP) and Token Passing with Task Swaps (TPTS)—and evaluates their performance against a centralized approach, noted as CENTRAL, in simulated systems.

Problem Setting and Challenges

The MAPD problem involves agents navigating a shared environment modeled as an undirected graph, continuously assigned to new dynamically arriving tasks. Each task involves moving from a pickup location to a delivery location while coordinating to avoid collisions. Unlike MAPF, the agents must manage an evolving set of tasks, and the assignments and paths have to be dynamically recalculated.

Algorithms for Solving MAPD

Token Passing (TP):
- TP decentralizes the MAPD task by allowing each agent to request a token to plan its path and task assignments independently.
- The algorithm operates by updating paths in a shared memory, which each agent modifies sequentially, ensuring that paths are collision-free.
- A significant feature of TP is its role as a fully-distributed MAPD algorithm when real-time decision-making is prioritized. It achieves scalability, handling instances with many agents efficiently, albeit potentially with lower optimality in solution quality compared to centralized methods.
Token Passing with Task Swaps (TPTS):
- TPTS builds on TP's framework and introduces a mechanism for task swapping, where agents can reassign tasks if another can perform it more efficiently.
- The algorithm ensures that tasks are completed more quickly by dynamically re-evaluating all task-agent assignments upon each token possession.
- TPTS achieves a balanced performance, surpassing TP in service time reduction due to its flexible task assignment process while maintaining lower computational overheads than CENTRAL.
CENTRAL Algorithm:
- CENTRAL operates on a centralized approach, considering all agents and tasks collectively to map the optimal path from each current location to intended destinations using Conflict-Based Search, an optimal MAPF algorithm.
- Though computationally intensive—limiting it in large-scale, real-time operations—CENTRAL demonstrates superior service times due to its holistic environment coordination, particularly in less congested systems.

Implications and Future Work

The paper's algorithms are tested in various simulated environments resembling realistic warehouse layouts, showcasing that both TP and TPTS extend to large numbers of agents with differing efficiencies and service times. CENTRAL offers highly optimized coordination under low computational constraints, making it suitable for smaller, less time-sensitive environments. For future development, the paper suggests exploring further decentralization of TPTS and improving communication strategies among agents to balance computation with optimal task execution further.

Given the ongoing development of automated multi-agent systems, such as warehouse robots and autonomous vehicles, the practical implications of the algorithms presented within this paper are significant. By effectively managing path finding in dynamic and uncertain environments, such approaches facilitate advanced planning necessary for the further integration of automation in industrial and urban settings. The analysis suggests potential paths for future research, particularly involving hybrid models that adapt dynamically between decentralized and centralized methodologies.

In conclusion, the investigation into MAPD presents a significant leap in realistic multi-agent coordination systems, offering valuable insights and effective algorithms that balance computational efficiency with task execution, catering to various real-world automation applications.

PDF Markdown