Pointer Networks Trained Better via Evolutionary Algorithms (2312.01150v4)

Published 2 Dec 2023 in cs.NE

Abstract: Pointer Network (PtrNet) is a specific neural network for solving Combinatorial Optimization Problems (COPs). While PtrNets offer real-time feed-forward inference for complex COPs instances, its quality of the results tends to be less satisfactory. One possible reason is that such issue suffers from the lack of global search ability of the gradient descent, which is frequently employed in traditional PtrNet training methods including both supervised learning and reinforcement learning. To improve the performance of PtrNet, this paper delves deeply into the advantages of training PtrNet with Evolutionary Algorithms (EAs), which have been widely acknowledged for not easily getting trapped by local optima. Extensive empirical studies based on the Travelling Salesman Problem (TSP) have been conducted. Results demonstrate that PtrNet trained with EA can consistently perform much better inference results than eight state-of-the-art methods on various problem scales. Compared with gradient descent based PtrNet training methods, EA achieves up to 30.21\% improvement in quality of the solution with the same computational time. With this advantage, this paper is able to at the first time report the results of solving 1000-dimensional TSPs by training a PtrNet on the same dimensionality, which strongly suggests that scaling up the training instances is in need to improve the performance of PtrNet on solving higher-dimensional COPs.

References (51)

Summary

The paper demonstrates that training pointer networks with evolutionary algorithms avoids local optima and boosts performance.
It applies a Negatively Correlated Search method to handle large-scale instances, achieving a 30.21% improvement over gradient descent.
The study highlights that increasing EA population size enhances solution diversity and scalability for complex combinatorial problems.

Introduction

Pointer Networks (PtrNets) are specialized neural networks designed to address NP-hard Combinatorial Optimization Problems (COPs). They offer real-time inference making them advantageous over traditional methods. However, PtrNets are often trained using gradient descent methods, which come with a limitation: they easily get trapped in local optima. This paper proposes using Evolutionary Algorithms (EAs) to train PtrNets, exploiting their global search capability and propensity to avoid local optima.

Evolutionary Training Advantages

EAs differ significantly from gradient descent approaches. EAs perform searches using populations of solutions, applying operators like mutation and crossover to evolve these solutions over time. This method has two key advantages in training neural networks:

Global Search Capability: EAs can potentially find better overall solutions by exploring various regions of the solution space simultaneously, rather than focusing on a single gradient path.
Diverse Search: Because EAs use populations, they naturally maintain a diversity of possible solutions, which can prevent premature convergence to suboptimal points.

Empirical Evaluation

To assess the benefits of EAs, the researchers conducted extensive experiments with PtrNet trained using an EA called Negatively Correlated Search (NCS) to solve the Traveling Salesman Problem (TSP). They explored scenarios with dimensionality up to 1000-nodes, which is challenging for PtrNets traditionally trained with much smaller problems. The experimental results revealed:

PtrNet trained with EA (PtrNet-EA) achieved up to 30.21% improvement over gradient descent-based methods.
Training PtrNet on the same dimensionality as the problem instances (in this case, 1000-dimensional TSPs) led to significant performance gains.
Increasing the population size within the EA improved the performance further, showcasing the scalability and efficiency benefits of parallelized EAs.

Conclusions and Implications

By demonstrating that EAs can effectively train PtrNets to outperform gradient descent techniques on larger-scale problems, this paper paves the way for more robust neural solvers in the field of optimization. It suggests that future applications of PtrNets and other similar neural models can benefit from training via EAs, especially for large and complex COPs where traditional methods struggle. The results also indicate that for problems like the TSP, using a training set of the same scale as the test cases is crucial for reaching high-quality solutions.

PDF Markdown