Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 78 tok/s
Gemini 2.5 Pro 43 tok/s Pro
GPT-5 Medium 23 tok/s
GPT-5 High 29 tok/s Pro
GPT-4o 93 tok/s
GPT OSS 120B 470 tok/s Pro
Kimi K2 183 tok/s Pro
2000 character limit reached

Online Control of Adaptive Large Neighborhood Search using Deep Reinforcement Learning (2211.00759v3)

Published 1 Nov 2022 in cs.LG and cs.AI

Abstract: The Adaptive Large Neighborhood Search (ALNS) algorithm has shown considerable success in solving combinatorial optimization problems (COPs). Nonetheless, the performance of ALNS relies on the proper configuration of its selection and acceptance parameters, which is known to be a complex and resource-intensive task. To address this, we introduce a Deep Reinforcement Learning (DRL) based approach called DR-ALNS that selects operators, adjusts parameters, and controls the acceptance criterion throughout the search. The proposed method aims to learn, based on the state of the search, to configure ALNS for the next iteration to yield more effective solutions for the given optimization problem. We evaluate the proposed method on an orienteering problem with stochastic weights and time windows, as presented in an IJCAI competition. The results show that our approach outperforms vanilla ALNS, ALNS tuned with Bayesian optimization, and two state-of-the-art DRL approaches that were the winning methods of the competition, achieving this with significantly fewer training observations. Furthermore, we demonstrate several good properties of the proposed DR-ALNS method: it is easily adapted to solve different routing problems, its learned policies perform consistently well across various instance sizes, and these policies can be directly applied to different problem variants.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Learning 2-Opt Heuristics for Routing Problems via Deep Reinforcement Learning. SN Computer Science, 2(5): 1–16.
  2. A reinforcement learning approach to the orienteering problem with time windows. Computers & Operations Research, 133: 105357.
  3. Learn to design the heuristics for vehicle routing problem. arXiv preprint arXiv:2002.08539.
  4. A hybrid adaptive large neighborhood search heuristic for the team orienteering problem. Computers & Operations Research, 123: 105034.
  5. Neural Large Neighborhood Search for the Capacitated Vehicle Routing Problem. In ECAI 2020, 443–450. IOS Press.
  6. Deep reinforcement learning approach to solve dynamic vehicle routing problem with stochastic customers. In Proceedings of the international Conference on Automated Planning and Scheduling, volume 30, 394–402.
  7. Graph Reinforcement Learning for Operator Selection in the ALNS Metaheuristic. arXiv preprint arXiv:2302.14678.
  8. A general deep reinforcement learning hyperheuristic framework for solving combinatorial optimization problems. European Journal of Operational Research, 309(1): 446–468.
  9. Large-state reinforcement learning for hyper-heuristics. In Proceedings of the AAAI Conference on Artificial Intelligence, 12444–12452.
  10. Attention, learn to solve routing problems! arXiv preprint arXiv:1803.08475.
  11. Pomo: Policy optimization with multiple optima for reinforcement learning. Advances in Neural Information Processing Systems, 33: 21188–21198.
  12. SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization. J. Mach. Learn. Res., 23(54): 1–9.
  13. The irace package: Iterated racing for automatic algorithm configuration. Operations Research Perspectives, 3: 43–58.
  14. A survey of adaptive large neighborhood search algorithms and applications. Computers & Operations Research, 146: 105903.
  15. A solution approach to the orienteering problem with time windows and synchronisation constraints. Heliyon, 6(6): e04202.
  16. An adaptive large neighbourhood search for asset protection during escaped wildfires. Computers & Operations Research, 97: 125–134.
  17. An adaptive large neighborhood search heuristic for the pickup and delivery problem with time windows. Transportation science, 40(4): 455–472.
  18. A unified heuristic for a large class of vehicle routing problems with backhauls. European Journal of Operational Research, 171(3): 750–775.
  19. A comparison of acceptance criteria for the adaptive large neighbourhood search metaheuristic. Journal of Heuristics, 24: 783–815.
  20. Learning to Solve a Stochastic Orienteering Problem with Time Windows. In Learning and Intelligent Optimization: 16th International Conference, LION 16, 108–122. Springer.
  21. Record breaking optimization results using the ruin and recreate principle. Journal of Computational Physics, 159(2): 139–171.
  22. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347.
  23. Shaw, P. 1998. Using constraint programming and local search methods to solve vehicle routing problems. In International conference on principles and practice of constraint programming, 417–431. Springer.
  24. Learning a large neighborhood search algorithm for mixed integer programs. arXiv preprint arXiv:2107.10201.
  25. Neural network based large neighborhood search algorithm for ride hailing services. In EPIA Conference on Artificial Intelligence, 584–595. Springer.
  26. Solving the stochastic time-dependent orienteering problem with time windows. European Journal of Operational Research, 255(3): 699–718.
  27. Learning large neighborhood search policy for integer programming. Advances in Neural Information Processing Systems, 34: 30075–30087.
  28. The clustered team orienteering problem. Computers & Operations Research, 111: 386–399.
  29. The first AI4TSP competition: Learning to solve stochastic routing problems. Artificial Intelligence, 319: 103918.
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Ai Generate Text Spark Streamline Icon: https://streamlinehq.com

Paper Prompts

Sign up for free to create and run prompts on this paper using GPT-5.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube