Pointer Networks Trained Better via Evolutionary Algorithms (2312.01150v4)
Abstract: Pointer Network (PtrNet) is a specific neural network for solving Combinatorial Optimization Problems (COPs). While PtrNets offer real-time feed-forward inference for complex COPs instances, its quality of the results tends to be less satisfactory. One possible reason is that such issue suffers from the lack of global search ability of the gradient descent, which is frequently employed in traditional PtrNet training methods including both supervised learning and reinforcement learning. To improve the performance of PtrNet, this paper delves deeply into the advantages of training PtrNet with Evolutionary Algorithms (EAs), which have been widely acknowledged for not easily getting trapped by local optima. Extensive empirical studies based on the Travelling Salesman Problem (TSP) have been conducted. Results demonstrate that PtrNet trained with EA can consistently perform much better inference results than eight state-of-the-art methods on various problem scales. Compared with gradient descent based PtrNet training methods, EA achieves up to 30.21\% improvement in quality of the solution with the same computational time. With this advantage, this paper is able to at the first time report the results of solving 1000-dimensional TSPs by training a PtrNet on the same dimensionality, which strongly suggests that scaling up the training instances is in need to improve the performance of PtrNet on solving higher-dimensional COPs.
- Pointer Networks. In Advances in Neural Information Processing Systems, volume 28. Curran Associates, Inc., 2015.
- Stage-Wise Magnitude-Based Pruning for Recurrent Neural Networks. IEEE Transactions on Neural Networks and Learning Systems, pages 1–15, 2022.
- Solving Traveling Salesman Problem with Time Windows Using Hybrid Pointer Networks with Time Features. Sustainability, 13(22):12906, January 2021.
- Moshe Sniedovich. Dynamic Programming: Foundations and Principles. CRC press, 2010.
- P. Festa. A brief introduction to exact, approximation, and heuristic algorithms for solving hard combinatorial optimization problems. In 2014 16th International Conference on Transparent Optical Networks (ICTON), pages 1–20, July 2014.
- Energy management in hybrid microgrid with considering multiple power market and real time demand response. Energy, 174:10–23, May 2019.
- Joint resource allocation and trajectory design for multi-uav systems with moving users: Pointer network and unfolding. IEEE Transactions on Wireless Communications, 22(5):3310–3323, 2023.
- HierRL: Hierarchical Reinforcement Learning for Task Scheduling in Distributed Systems. In 2022 International Joint Conference on Neural Networks (IJCNN), pages 1–8, July 2022.
- Chenxu Shi. Pointer Network Solution Pool : Combining Pointer Networks and Heuristics to Solve TSP Problems. In 2022 3rd International Conference on Computer Vision, Image and Deep Learning & International Conference on Computer Engineering and Applications (CVIDL & ICCEA), pages 1236–1242, May 2022.
- Deep Reinforcement Learning for the Electric Vehicle Routing Problem With Time Windows. IEEE Transactions on Intelligent Transportation Systems, 23(8):11528–11538, August 2022.
- Learning to Perform Local Rewriting for Combinatorial Optimization. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- A Pointer Network Based Deep Learning Algorithm for the Max-Cut Problem. In Long Cheng, Andrew Chi Sing Leung, and Seiichi Ozawa, editors, Neural Information Processing, Lecture Notes in Computer Science, pages 238–248, Cham, 2018. Springer International Publishing.
- How Good Is Neural Combinatorial Optimization? A Systematic Evaluation on the Traveling Salesman Problem, April 2023.
- Keld Helsgaun. An effective implementation of the Lin–Kernighan traveling salesman heuristic. European Journal of Operational Research, 126(1):106–130, October 2000.
- Combinatorial optimization by graph pointer networks and hierarchical reinforcement learning. arXiv preprint arXiv:1911.04936, 2019.
- Attention is not all you need: Pure attention loses rank doubly exponentially with depth. In International Conference on Machine Learning, pages 2793–2803. PMLR, 2021.
- POMO: Policy Optimization with Multiple Optima for Reinforcement Learning. In Advances in Neural Information Processing Systems, volume 33, pages 21188–21198. Curran Associates, Inc., 2020.
- Neural combinatorial optimization with reinforcement learning. arXiv preprint arXiv:1611.09940, 2016.
- Attention, Learn to Solve Routing Problems!, February 2019.
- Heterogeneous Pointer Network for Travelling Officer Problem. In 2022 International Joint Conference on Neural Networks (IJCNN), pages 1–8, July 2022.
- Reinforcement Learning for Solving the Vehicle Routing Problem. In Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018.
- A Gradient-Guided Evolutionary Approach to Training Deep Neural Networks. IEEE Transactions on Neural Networks and Learning Systems, 33(9):4861–4875, September 2022.
- X. Zhao and S. B. Holden. Towards a Competitive 3-Player Mahjong AI using Deep Reinforcement Learning. In 2022 IEEE Conference on Games (CoG), pages 524–527, 21.
- Hierarchical Neural Architecture Search for Deep Stereo Matching. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 22158–22169. Curran Associates, Inc., 2020.
- A Survey on Evolutionary Construction of Deep Neural Networks. IEEE Transactions on Evolutionary Computation, 25(5):894–912, October 2021.
- Natural evolution strategies. The Journal of Machine Learning Research, 15(1):949–980, 2014.
- Negatively correlated search. IEEE Journal on Selected Areas in Communications, 34(3):542–550, 2016.
- Parallel exploration via negatively correlated search. Frontiers of Computer Science, 15(5):155333, July 2021.
- A Scalable Approach to Capacitated Arc Routing Problems Based on Hierarchical Decomposition. IEEE Transactions on Cybernetics, 47(11):3928–3940, November 2017.
- Evaluating Curriculum Learning Strategies in Neural Combinatorial Optimization, November 2020.
- Kenji Kawaguchi. Deep Learning without Poor Local Minima. In Advances in Neural Information Processing Systems, volume 29. Curran Associates, Inc., 2016.
- “Neural” computation of decisions in optimization problems. Biological Cybernetics, 52(3):141–152, July 1985.
- J. Wang. A recurrent neural network for solving the shortest path problem. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, 43(6):482–486, June 1996.
- A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 42(6):1291–1307, November 2012.
- Deep Reinforcement Learning for Multiobjective Optimization. IEEE Transactions on Cybernetics, 51(6):3103–3114, June 2021.
- Xin Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9):1423–1447, September 1999.
- Search Heuristics for the Optimization of DBN for Time Series Forecasting. In Hitoshi Iba and Nasimul Noman, editors, Deep Neural Evolution: Deep Learning with Evolutionary Computation, pages 131–152. Springer Singapore, Singapore, 2020.
- Discovering Gated Recurrent Neural Network Architectures. In Hitoshi Iba and Nasimul Noman, editors, Deep Neural Evolution: Deep Learning with Evolutionary Computation, pages 233–251. Springer Singapore, Singapore, 2020.
- Evolution Strategies as a Scalable Alternative to Reinforcement Learning, September 2017.
- Evolutionary reinforcement learning via cooperative coevolutionary negatively correlated search. Swarm and Evolutionary Computation, 68:100974, 2022.
- Evolutionary job scheduling with optimized population by deep reinforcement learning. Engineering Optimization, 55(3):494–509, March 2023.
- Accelerating the Genetic Algorithm for Large-scale Traveling Salesman Problems by Cooperative Coevolutionary Pointer Network with Reinforcement Learning. arXiv preprint arXiv:2209.13077, 2022.
- Multi-Objective Neural Evolutionary Algorithm for Combinatorial Optimization Problems. IEEE Transactions on Neural Networks and Learning Systems, 34(4):2133–2143, April 2023.
- Evolutionary programming made faster. IEEE Transactions on Evolutionary Computation, 3(2):82–102, July 1999.
- T. Kailath. The Divergence and Bhattacharyya Distance Measures in Signal Selection. IEEE Transactions on Communication Technology, 15(1):52–60, February 1967.
- Evolution strategies – A comprehensive introduction. Natural Computing, 1(1):3–52, March 2002.
- Reinforcement learning for combinatorial optimization: A survey. Computers & Operations Research, 134:105400, 2021.
- Learning Combinatorial Optimization Algorithms over Graphs. In Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017.
- Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem. Proceedings of the AAAI Conference on Artificial Intelligence, 35(14):12445–12452, May 2021.
- Local search in combinatorial optimization. Artificial Neural Networks: An Introduction to ANN Theory and Practice, pages 157–174, 2005.
- Rafał Skinderowicz. Improving Ant Colony Optimization efficiency for solving large TSP instances. Applied Soft Computing, 120:108653, May 2022.