Learning to Deliver: a Foundation Model for the Montreal Capacitated Vehicle Routing Problem (2403.00026v1)
Abstract: In this paper, we present the Foundation Model for the Montreal Capacitated Vehicle Routing Problem (FM-MCVRP), a novel Deep Learning (DL) model that approximates high-quality solutions to a variant of the Capacitated Vehicle Routing Problem (CVRP) that characterizes many real-world applications. The so-called Montreal Capacitated Vehicle Routing Problem (MCVRP), first formally described by Bengio et al. (2021), is defined on a fixed and finite graph, which is analogous to a city. Each MCVRP instance is essentially the sub-graph connecting a randomly sampled subset of the nodes in the fixed graph, which represent a set of potential addresses in a real-world delivery problem on a given day. Our work exploits this problem structure to frame the MCVRP as an analogous NLP task. Specifically, we leverage a Transformer architecture embedded in a LLM framework to train our model in a supervised manner on computationally inexpensive, sub-optimal MCVRP solutions obtained algorithmically. Through comprehensive computational experiments, we show that FM-MCVRP produces better MCVRP solutions than the training data and generalizes to larger sized problem instances not seen during training. Even when compared to near-optimal solutions from state-of-the-art heuristics, FM-MCVRP yields competitive results despite being trained on inferior data. For instance, for 400-customer problems, FM-MCVRP solutions on average fall within 2% of the benchmark. Our results further demonstrate that unlike prior works in the literature, FM-MCVRP is a unified model, which performs consistently and reliably on a range of problem instance sizes and parameter values such as the vehicle capacity.
- Bertsimas D, Tsitsiklis J (1993) Simulated annealing. Statistical science 8(1):10–15.
- Croes GA (1958) A method for solving traveling-salesman problems. Operations research 6(6):791–812.
- Helsgaun K (2017) An extension of the Lin-Kernighan-Helsgaun TSP solver for constrained traveling salesman and vehicle routing problems. Roskilde: Roskilde University 24–50.
- Hopfield JJ, Tank DW (1985) “Neural” computation of decisions in optimization problems. Biological Cybernetics 52(3):141–152, ISSN 0340-1200, URL http://dx.doi.org/10.1007/BF00339943.
- Kingma DP, Ba J (2014) Adam: A Method for Stochastic Optimization .
- Krizhevsky A (2014) One weird trick for parallelizing convolutional neural networks .
- Lin S, Kernighan BW (1973) An effective heuristic algorithm for the traveling-salesman problem. Operations research 21(2):498–516.
- Loshchilov I, Hutter F (2016) SGDR: Stochastic Gradient Descent with Warm Restarts .
- Lowe DG (2004) Distinctive image features from scale-invariant keypoints. International journal of computer vision 60:91–110.
- Lowerre BT (1976) The HARPY speech recognition system .
- Shaw P (1998) Using constraint programming and local search methods to solve vehicle routing problems. International conference on principles and practice of constraint programming, 417–431.
- Toth P, Vigo D (2002) The vehicle routing problem (SIAM).
- Williams RJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning 8(3):229–256.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.