Flexible Job Shop Scheduling via Dual Attention Network Based Reinforcement Learning (2305.05119v2)
Abstract: Flexible manufacturing has given rise to complex scheduling problems such as the flexible job shop scheduling problem (FJSP). In FJSP, operations can be processed on multiple machines, leading to intricate relationships between operations and machines. Recent works have employed deep reinforcement learning (DRL) to learn priority dispatching rules (PDRs) for solving FJSP. However, the quality of solutions still has room for improvement relative to that by the exact methods such as OR-Tools. To address this issue, this paper presents a novel end-to-end learning framework that weds the merits of self-attention models for deep feature extraction and DRL for scalable decision-making. The complex relationships between operations and machines are represented precisely and concisely, for which a dual-attention network (DAN) comprising several interconnected operation message attention blocks and machine message attention blocks is proposed. The DAN exploits the complicated relationships to construct production-adaptive operation and machine features to support high-quality decisionmaking. Experimental results using synthetic data as well as public benchmarks corroborate that the proposed approach outperforms both traditional PDRs and the state-of-the-art DRL method. Moreover, it achieves results comparable to exact methods in certain cases and demonstrates favorable generalization ability to large-scale and real-world unseen FJSP tasks.
- S. Mao, B. Wang, Y. Tang, and F. Qian, “Opportunities and challenges of artificial intelligence for green manufacturing in the process industry,” Eng., vol. 5, no. 6, pp. 995–1002, Dec. 2019.
- J. Chen, J. Sun, and G. Wang, “From unmanned systems to autonomous intelligent systems,” Eng., vol. 12, no. 5, pp. 16–19, 2022.
- K. Ding, F. T. Chan, X. Zhang, G. Zhou, and F. Zhang, “Defining a digital twin-based cyber-physical production system for autonomous manufacturing in smart shop floors,” Int. J. Prod. Res., vol. 57, no. 20, pp. 6315–6334, Jan. 2019.
- A. Arunarani, D. Manjula, and V. Sugumaran, “Task scheduling techniques in cloud computing: A literature survey,” Future Gener. Comput. Syst., vol. 91, pp. 407–415, Feb. 2019.
- S. Satunin and E. Babkin, “A multi-agent approach to intelligent transportation systems modeling with combinatorial auctions,” Expert Syst. Appl., vol. 41, no. 15, pp. 6622–6633, Nov. 2014.
- J. Zhang, G. Ding, Y. Zou, S. Qin, and J. Fu, “Review of job shop scheduling research and its new perspectives under Industry 4.0,” J. Intell. Manuf., vol. 30, pp. 1809–1830, 2019.
- J. Xie, L. Gao, K. Peng, X. Li, and H. Li, “Review on flexible job shop scheduling,” IET Collab. Intell. Manuf., vol. 1, no. 3, pp. 67–77, Sep. 2019.
- L. Meng, C. Zhang, Y. Ren, B. Zhang, and C. Lv, “Mixed-integer linear programming and constraint programming formulations for solving distributed flexible job shop scheduling problem,” Comput. Ind. Eng., vol. 142, p. 106347, Apr. 2020.
- Y. Demir and S. K. İşleyen, “Evaluation of mathematical models for flexible job-shop scheduling problems,” Appl. Math. Modell., vol. 37, no. 3, pp. 977–988, Feb. 2013.
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski et al., “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, Feb. 2015.
- M. Wang and B. Xin, “A genetic algorithm for solving flexible flow shop scheduling problem with autonomous guided vehicles,” in IEEE Intl. Conf. Control Autom. IEEE, 2019, pp. 922–927.
- D. Rooyani and F. M. Defersha, “An efficient two-stage genetic algorithm for flexible job-shop scheduling,” IFAC-PapersOnLine, vol. 52, no. 13, pp. 2519–2524, 2019.
- G. Zhang, X. Shao, P. Li, and L. Gao, “An effective hybrid particle swarm optimization algorithm for multi-objective flexible job-shop scheduling problem,” Comput. Ind. Eng., vol. 56, no. 4, pp. 1309–1318, May 2009.
- W. Du, W. Zhong, Y. Tang, W. Du, and Y. Jin, “High-dimensional robust multi-objective optimization for order scheduling: A decision variable classification approach,” IEEE Trans. Ind. Inf., vol. 15, no. 1, pp. 293–304, May 2018.
- J.-Q. Li, M.-X. Song, L. Wang, P.-Y. Duan, Y.-Y. Han, H.-Y. Sang, and Q.-K. Pan, “Hybrid artificial bee colony algorithm for a parallel batching distributed flow-shop problem with deteriorating jobs,” IEEE Trans. Cybern., vol. 50, no. 6, pp. 2425–2439, Oct. 2019.
- R. Haupt, “A survey of priority rule-based scheduling,” Oper. Res. Spektrum, vol. 11, no. 1, pp. 3–16, Mar. 1989.
- L. Wang, Z. Pan, and J. Wang, “A review of reinforcement learning based intelligent optimization for manufacturing scheduling,” Compl. Syst. Model. Simul., vol. 1, no. 4, pp. 257–270, Dec. 2021.
- L. Wang, X. Hu, Y. Wang, S. Xu, S. Ma, K. Yang, Z. Liu, and W. Wang, “Dynamic job-shop scheduling in smart manufacturing using deep reinforcement learning,” Comput. Netw., vol. 190, p. 107969, May 2021.
- S. Luo, L. Zhang, and Y. Fan, “Real-time scheduling for dynamic partial-no-wait multiobjective flexible job shop by deep reinforcement learning,” IEEE Trans. Autom. Sci. Eng., vol. 19, no. 4, pp. 3020–3038, Aug. 2021.
- Y. Du, J. Li, C. Li, and P. Duan, “A reinforcement learning approach for flexible job shop scheduling problem with crane transportation and setup times,” IEEE Trans. Neural Netw. Learn. Syst., Oct. 2022, DOI: 10.1109/TNNLS.2022.3208942.
- P. Brandimarte, “Routing and scheduling in a flexible job shop by tabu search,” Ann. Oper. Res., vol. 41, no. 3, pp. 157–183, Sep. 1993.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Adv. Neural Inf. Process. Syst., vol. 30, 2017.
- M. Nazari, A. Oroojlooy, L. Snyder, and M. Takác, “Reinforcement learning for solving the vehicle routing problem,” Adv. Neural Inf. Process. Syst., vol. 31, 2018.
- H. V. H. W. Kool and M. Welling, “Attention, learn to solve routing problems!” Proc. Int. Conf. Learn. Represent., 2018.
- A. Mirhoseini, A. Goldie, M. Yazgan, J. W. Jiang, E. Songhori, S. Wang, Y.-J. Lee, E. Johnson, O. Pathak, A. Nazi et al., “A graph placement methodology for fast chip design,” Nature, vol. 594, no. 7862, pp. 207–212, June 2021.
- Y.-D. Kwon, J. Choo, I. Yoon, M. Park, D. Park, and Y. Gwon, “Matrix encoding networks for neural combinatorial optimization,” Adv. Neural Inf. Process. Syst., vol. 34, pp. 5138–5149, 2021.
- S. Manchanda, A. Mittal, A. Dhawan, S. Medya, S. Ranu, and A. Singh, “GCOMB: Learning budget-constrained combinatorial algorithms over billion-sized graphs,” Adv. Neural Inf. Process. Syst., vol. 33, pp. 20 000–20 011, 2020.
- C. Zhang, W. Song, Z. Cao, J. Zhang, P. S. Tan, and X. Chi, “Learning to dispatch for job shop scheduling via deep reinforcement learning,” Adv. Neural Inf. Process. Syst., vol. 33, pp. 1621–1632, 2020.
- J. Park, J. Chun, S. H. Kim, Y. Kim, and J. Park, “Learning to schedule job-shop problems: Representation and policy learning using graph neural network and reinforcement learning,” Int. J. Prod. Res., vol. 59, no. 11, pp. 3360–3377, 2021.
- R. Chen, W. Li, and H. Yang, “A deep reinforcement learning framework based on an attention mechanism and disjunctive graph embedding for the job-shop scheduling problem,” IEEE Trans. Ind. Inf., vol. 19, no. 2, pp. 1322–1331, Apr. 2022.
- K. Lei, P. Guo, W. Zhao, Y. Wang, L. Qian, X. Meng, and L. Tang, “A multi-action deep reinforcement learning framework for flexible job-shop scheduling problem,” Expert Syst. Appl., vol. 205, p. 117796, Nov. 2022.
- W. Song, X. Chen, Q. Li, and Z. Cao, “Flexible job-shop scheduling via graph neural network and deep reinforcement learning,” IEEE Trans. Industr. Inform., vol. 19, no. 2, pp. 1600–1610, Feb. 2023.
- G. Huang, Z. Liu, G. Pleiss, L. Van Der Maaten, and K. Q. Weinberger, “Convolutional networks with dense connectivity,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 44, no. 12, pp. 8704–8716, May 2019.
- P. Veličković, G. Cucurull, A. Casanova, A. Romero, P. Lio, and Y. Bengio, “Graph attention networks,” Proc. Int. Conf. Learn. Represent., 2017.
- J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv:1707.06347, 2017.
- J. Schulman, P. Moritz, S. Levine, M. Jordan, and P. Abbeel, “High-dimensional continuous control using generalized advantage estimation,” arXiv:1506.02438, 2015.
- J. Hurink, B. Jurisch, and M. Thole, “Tabu search for the job-shop scheduling problem with multi-purpose machines,” Oper. Res. Spektrum, vol. 15, pp. 205–215, Dec. 1994.
- D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv:1412.6980, 2014.
- V. Sels, N. Gheysen, and M. Vanhoucke, “A comparison of priority rules for the job shop scheduling problem under different flow time-and tardiness-related objective functions,” Int. J. Prod. Res., vol. 50, no. 15, pp. 4255–4270, Aug. 2012.
- D. Behnke and M. J. Geiger, “Test instances for the flexible job shop scheduling problem with work centers,” Arbeitspapier/Research Paper/Helmut-Schmidt-Universität, Lehrstuhl für Betriebswirtschaftslehre, insbes. Logistik-Management, 2012.