Constrained Multi-objective Optimization with Deep Reinforcement Learning Assisted Operator Selection (2402.12381v1)
Abstract: Solving constrained multi-objective optimization problems with evolutionary algorithms has attracted considerable attention. Various constrained multi-objective optimization evolutionary algorithms (CMOEAs) have been developed with the use of different algorithmic strategies, evolutionary operators, and constraint-handling techniques. The performance of CMOEAs may be heavily dependent on the operators used, however, it is usually difficult to select suitable operators for the problem at hand. Hence, improving operator selection is promising and necessary for CMOEAs. This work proposes an online operator selection framework assisted by Deep Reinforcement Learning. The dynamics of the population, including convergence, diversity, and feasibility, are regarded as the state; the candidate operators are considered as actions; and the improvement of the population state is treated as the reward. By using a Q-Network to learn a policy to estimate the Q-values of all actions, the proposed approach can adaptively select an operator that maximizes the improvement of the population according to the current state and thereby improve the algorithmic performance. The framework is embedded into four popular CMOEAs and assessed on 42 benchmark problems. The experimental results reveal that the proposed Deep Reinforcement Learning-assisted operator selection significantly improves the performance of these CMOEAs and the resulting algorithm obtains better versatility compared to nine state-of-the-art CMOEAs.
- A. Kumar, G. Wu, M. Z. Ali, Q. Luo, R. Mallipeddi, P. N. Suganthan, and S. Das, “A benchmark-suite of real-world constrained multi-objective optimization problems and some baseline results,” Swarm and Evolutionary Computation, vol. 67, p. 100961, 2021.
- B. Tan, H. Ma, Y. Mei, and M. Zhang, “Evolutionary multi-objective optimization for web service location allocation problem,” IEEE Transactions on Services Computing, vol. 14, no. 2, pp. 458–471, 2021.
- Z. Ma and Y. Wang, “Shift-based penalty for evolutionary constrained multiobjective optimization and its application,” IEEE Transactions on Cybernetics, pp. 1–13, 2021, doi:10.1109/TCYB.2021.3069814.
- Y. Tian, T. Zhang, J. Xiao, X. Zhang, and Y. Jin, “A coevolutionary framework for constrained multiobjective optimization problems,” IEEE Transactions on Evolutionary Computation, vol. 25, no. 1, pp. 102–116, 2021.
- Y. Tian, Y. Zhang, Y. Su, X. Zhang, K. C. Tan, and Y. Jin, “Balancing objective optimization and constraint satisfaction in constrained evolutionary multiobjective optimization,” IEEE Transactions on Cybernetics, vol. 52, no. 9, pp. 9559–9572, 2022.
- R. Jiao, B. Xue, and M. Zhang, “A multiform optimization framework for constrained multiobjective optimization,” IEEE Transactions on Cybernetics, pp. 1–13, 2022, doi:10.1109/TCYB.2022.3178132.
- K. Qiao, K. Yu, B. Qu, J. Liang, H. Song, and C. Yue, “An evolutionary multitasking optimization framework for constrained multiobjective optimization problems,” IEEE Transactions on Evolutionary Computation, vol. 26, no. 2, pp. 263–277, 2022.
- Z. Ma, Y. Wang, and W. Song, “A new fitness function with two rankings for evolutionary constrained multiobjective optimization,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 51, no. 8, pp. 5005–5016, 2021.
- Y. Tian, X. Zhang, C. He, K. Tan, and Y. Jin, “Principled design of translation, scale, and rotation invariant variation operators for metaheuristics,” Chinese Journal of Electronics, 07 2022.
- J. Holland, “Adaptation in natural and artificial systems,” An Introductory Analysis with Application to Biology, Control and Artificial Intelligence, 01 1994.
- R. Storn and K. Price, “Differential evolution - a simple and efficient heuristic for global optimization over continuous spaces,” Journal of Global Optimization, vol. 11, pp. 341–359, 01 1997.
- R. Eberhart and J. Kennedy, “A new optimizer using particle swarm theory,” in MHS’95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science, 1995, pp. 39–43.
- X. Zhang, X. Zheng, R. Cheng, J. Qiu, and Y. Jin, “A competitive mechanism based multi-objective particle swarm optimizer with fast convergence,” Information Sciences, vol. 427, pp. 63–76, 2018.
- L. Hu, Y. Yang, Z. Tang, Y. He, and X. Luo, “Fcan-mopso: An improved fuzzy-based graph clustering algorithm for complex networks with multi-objective particle swarm optimization,” IEEE Transactions on Fuzzy Systems, pp. 1–16, 2023.
- L. Hu, K. C. C. Chan, X. Yuan, and S. Xiong, “A variational bayesian framework for cluster analysis in a complex network,” IEEE Transactions on Knowledge and Data Engineering, vol. 32, no. 11, pp. 2115–2128, 2020.
- C. Wang, R. Xu, and X. Zhang, “An evolutionary algorithm based on multi-operator ensemble for multi-objective optimization,” in Intelligent Computing Theories and Application, D.-S. Huang, V. Bevilacqua, and P. Premaratne, Eds. Cham: Springer International Publishing, 2019, pp. 14–24.
- Y. Tian, X. Li, H. Ma, X. Zhang, K. C. Tan, and Y. Jin, “Deep reinforcement learning based adaptive operator selection for evolutionary multi-objective optimization,” IEEE Transactions on Emerging Topics in Computational Intelligence, pp. 1–14, 2022, doi:10.1109/TETCI.2022.3146882.
- L. Dong, Q. Lin, Y. Zhou, and J. Jiang, “Adaptive operator selection with test-and-apply structure for decomposition-based multi-objective optimization,” Swarm and Evolutionary Computation, vol. 68, p. 101013, 2022.
- S. Schneider, R. Khalili, A. Manzoor, H. Qarawlus, R. Schellenberg, H. Karl, and A. Hecker, “Self-learning multi-objective service coordination using deep reinforcement learning,” IEEE Transactions on Network and Service Management, vol. 18, no. 3, pp. 3829–3842, 2021.
- L. Caviglione, M. Gaggero, M. Paolucci, and R. Ronco, “Deep reinforcement learning for multi-objective placement of virtual machines in cloud datacenters,” Soft Computing, vol. 25, pp. 1–20, 10 2021.
- W. Liu, R. Wang, T. Zhang, K. Li, W. Li, and H. Ishibuchi, “Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization,” IEEE Transactions on Evolutionary Computation, pp. 1–1, 2022, doi:10.1109/TEVC.2022.3199045.
- Y. Li, G. Hao, Y. Liu, Y. Yu, Z. Ni, and Y. Zhao, “Many-objective distribution network reconfiguration via deep reinforcement learning assisted optimization algorithm,” IEEE Transactions on Power Delivery, vol. 37, no. 3, pp. 2230–2244, 2022.
- F. Zhao, S. Di, and L. Wang, “A hyperheuristic with q-learning for the multiobjective energy-efficient distributed blocking flow shop scheduling problem,” IEEE Transactions on Cybernetics, vol. 53, no. 5, pp. 3337–3350, 2023.
- K. Li, T. Zhang, and R. Wang, “Deep reinforcement learning for multiobjective optimization,” IEEE Transactions on Cybernetics, vol. 51, no. 6, pp. 3103–3114, 2021.
- Z. Zhang, Z. Wu, H. Zhang, and J. Wang, “Meta-learning-based deep reinforcement learning for multiobjective optimization problems,” IEEE Transactions on Neural Networks and Learning Systems, pp. 1–14, 2022, doi:10.1109/TNNLS.2022.3148435.
- Z. Fan, W. Li, X. Cai, H. Li, C. Wei, Q. Zhang, K. Deb, and E. Goodman, “Push and pull search for solving constrained multi-objective optimization problems,” Swarm and Evolutionary Computation, vol. 44, pp. 665–679, 2019.
- Q. Zhu, Q. Zhang, and Q. Lin, “A constrained multiobjective evolutionary algorithm with detect-and-escape strategy,” IEEE Transactions on Evolutionary Computation, vol. 24, no. 5, pp. 938–947, 2020.
- A. Santiago, B. Dorronsoro, H. J. Fraire, and P. Ruiz, “Micro-genetic algorithm with fuzzy selection of operators for multi-objective optimization: μ𝜇\muitalic_μfame,” Swarm and Evolutionary Computation, vol. 61, p. 100818, 2021.
- Y. Yuan, H. Xu, and B. Wang, “An experimental investigation of variation operators in reference-point based many-objective optimization,” in Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation, ser. GECCO ’15. New York, NY, USA: Association for Computing Machinery, 2015, p. 775–782.
- K. McClymont and E. C. Keedwell, “Markov chain hyper-heuristic (mchh): An online selective hyper-heuristic for multi-objective continuous problems,” in Proceedings of the 13th Annual Conference on Genetic and Evolutionary Computation, ser. GECCO ’11. New York, NY, USA: Association for Computing Machinery, 2011, p. 2003–2010.
- A. Lin, P. Yu, S. Cheng, and L. Xing, “One-to-one ensemble mechanism for decomposition-based multi-objective optimization,” Swarm and Evolutionary Computation, vol. 68, p. 101007, 2022.
- R. Sutton and A. Barto, “Reinforcement learning: An introduction,” IEEE Transactions on Neural Networks, vol. 9, no. 5, pp. 1054–1054, 1998.
- C. Watkins and P. Dayan, “Technical note: Q-learning,” Machine Learning, vol. 8, pp. 279–292, 05 1992.
- V. Mnih, K. Kavukcuoglu, D. Silver, A. Rusu, J. Veness, M. Bellemare, A. Graves, M. Riedmiller, A. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, “Human-level control through deep reinforcement learning,” Nature, vol. 518, pp. 529–33, 02 2015.
- P. Kerschke, H. H. Hoos, F. Neumann, and H. Trautmann, “Automated Algorithm Selection: Survey and Perspectives,” Evolutionary Computation, vol. 27, no. 1, pp. 3–45, 03 2019.
- W. Gong, Z. Cai, and D. Liang, “Adaptive ranking mutation operator based differential evolution for constrained optimization,” IEEE Transactions on Cybernetics, vol. 45, no. 4, pp. 716–727, 2015.
- Q. Zhang, A. Zhou, S. Zhao, P. Suganthan, W. Liu, and S. Tiwari, “Multiobjective optimization test instances for the cec 2009 special session and competition,” Mechanical Engineering, 01 2008.
- Z. Fan, W. Li, X. Cai, H. Li, C. Wei, Q. Zhang, K. Deb, and E. Goodman, “Difficulty adjustable and scalable constrained multiobjective test problem toolkit,” Evolutionary Computation, vol. 28, no. 3, pp. 339–378, 2020.
- Z. Liu and Y. Wang, “Handling constrained multiobjective optimization problems with constraints in both the decision and objective spaces,” IEEE Transactions on Evolutionary Computation, vol. 23, no. 5, pp. 870–884, 2019.
- Z. Fan, W. Li, X. Cai, H. Huang, Y. Fang, Y. Yugen, J. Mo, C. Wei, and E. Goodman, “An improved epsilon constraint-handling method in moea/d for cmops with large infeasible regions,” Soft Computing, vol. 23, 12 2019.
- M. Ming, A. Trivedi, R. Wang, D. Srinivasan, and T. Zhang, “A dual-population-based evolutionary algorithm for constrained multiobjective optimization,” IEEE Transactions on Evolutionary Computation, vol. 25, no. 4, pp. 739–753, 2021.
- Z.-Z. Liu, B.-C. Wang, and K. Tang, “Handling constrained multiobjective optimization problems via bidirectional coevolution,” IEEE Transactions on Cybernetics, vol. 52, no. 10, pp. 10 163–10 176, 2022.
- M. Li, S. Yang, and X. Liu, “Shift-based density estimation for pareto-based algorithms in many-objective optimization,” IEEE Transactions on Evolutionary Computation, vol. 18, no. 3, pp. 348–365, 2014.
- K. Yu, J. Liang, B. Qu, Y. Luo, and C. Yue, “Dynamic selection preference-assisted constrained multiobjective differential evolution,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no. 5, pp. 2954–2965, 2022.
- J. Yuan, H.-L. Liu, and Z. He, “A constrained multi-objective evolutionary algorithm using valuable infeasible solutions,” Swarm and Evolutionary Computation, vol. 68, p. 101020, 2022.
- R. Agrawal, K. Deb, and R. Agrawal, “Simulated binary crossover for continuous search space,” Complex Systems, vol. 9, pp. 115–148, 06 2000.
- H. Ishibuchi, R. Imada, N. Masuyama, and Y. Nojima, “Comparison of hypervolume, igd and igd+ from the viewpoint of optimal distributions of solutions,” in Evolutionary Multi-Criterion Optimization, 01 2019, pp. 332–345.
- H. Ishibuchi, L. M. Pang, and K. Shang, “Difficulties in fair performance comparison of multi-objective evolutionary algorithms [research frontier],” IEEE Computational Intelligence Magazine, vol. 17, no. 1, pp. 86–101, 2022.
- J. Alcalá-Fdez, L. Sánchez, S. García, M. J. del Jesus, S. Ventura, J. M. Garrell, J. Otero, C. Romero, J. Bacardit, V. M. Rivas, J. C. Fernández, and F. Herrera, “KEEL: A software tool to assess evolutionary algorithms for data mining problems,” Soft Comput., vol. 13, no. 3, pp. 307–318, 2009.
- J. Schott, “Fault tolerant design using single and multicriteria genetic algorithm optimization,” p. 203, 05 1995.
- Y. Tian, H. Chen, H. Ma, X. Zhang, K. C. Tan, and Y. Jin, “Integrating conjugate gradients into evolutionary algorithms for large-scale continuous multi-objective optimization,” IEEE/CAA Journal of Automatica Sinica, vol. 9, no. 10, pp. 1801–1817, 2022.
- Y. Tian, Y. Feng, X. Zhang, and C. Sun, “A fast clustering based evolutionary algorithm for super-large-scale sparse multi-objective optimization,” IEEE/CAA Journal of Automatica Sinica, vol. 10, no. 4, pp. 1048–1063, 2023.
- H. Li, J. Li, P. Wu, Y. You, and N. Zeng, “A ranking-system-based switching particle swarm optimizer with dynamic learning strategies,” Neurocomputing, vol. 494, pp. 356–367, 2022.
- N. Zeng, Z. Wang, W. Liu, H. Zhang, K. Hone, and X. Liu, “A dynamic neighborhood-based switching particle swarm optimization algorithm,” IEEE Transactions on Cybernetics, vol. 52, no. 9, pp. 9290–9301, 2022.
- D. Guo, X. Wang, K. Gao, Y. Jin, J. Ding, and T. Chai, “Evolutionary optimization of high-dimensional multiobjective and many-objective expensive problems assisted by a dropout neural network,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no. 4, pp. 2084–2097, 2022.
- X. Luo, Y. Yuan, S. Chen, N. Zeng, and Z. Wang, “Position-transitional particle swarm optimization-incorporated latent factor analysis,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 8, pp. 3958–3970, 2022.
- Fei Ming (14 papers)
- Wenyin Gong (3 papers)
- Ling Wang (89 papers)
- Yaochu Jin (108 papers)