Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode (2403.00318v1)
Abstract: We introduce a deep reinforcement learning (DRL) approach for solving management problems including inventory management, dynamic pricing, and recommendation. This DRL approach has the potential to lead to a large management model based on certain transformer neural network structures, resulting in an artificial general intelligence paradigm for various management tasks. Traditional methods have limitations for solving complex real-world problems, and we demonstrate how DRL can surpass existing heuristic approaches for solving management tasks. We aim to solve the problems in a unified framework, considering the interconnections between different tasks. Central to our methodology is the development of a foundational decision model coordinating decisions across the different domains through generative decision-making. Our experimental results affirm the effectiveness of our DRL-based framework in complex and dynamic business environments. This work opens new pathways for the application of DRL in management problems, highlighting its potential to revolutionize traditional business management.
- Optimal policies for a multi-echelon inventory problem. Management science, 6(4):475–490, 1960.
- Dynamic inventory–pricing control under backorder: Demand estimation and policy optimization. Manufacturing & Service Operations Management, 16(1):149–160, 2014.
- Optimization of recommender systems based on inventory. Production and Operations Management, 25(4):593–608, 2016.
- Xiaotian Liu and Yijie Alexopoulos, Christos Peng. Deep reinforcement learning for large-scale inventory management. Available at SSRN 4490327, 2023.
- Dynamic pricing in the presence of inventory considerations: Research overview, current practices, and future directions. Management science, 49(10):1287–1309, 2003.
- Integrating dynamic pricing with inventory decisions under lost sales. Management Science, 66(5):2232–2247, 2020.
- Joint dynamic pricing and order fulfillment for e-commerce retailers. Manufacturing & Service Operations Management, 20(2):269–284, 2018.
- Decision transformer: Reinforcement learning via sequence modeling. Advances in neural information processing systems, 34:15084–15097, 2021.
- Offline reinforcement learning as one big sequence modeling problem. Advances in neural information processing systems, 34:1273–1286, 2021.
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258, 2021.
- A practical end-to-end inventory management model with deep learning. Management Science, 69(2):759–773, 2023.
- Combined pricing and inventory control under uncertainty. Operations research, 47(3):454–475, 1999.
- Data-driven dynamic pricing and inventory management of an omni-channel retailer in an uncertain demand environment. Available at SSRN 4395571, 2023.
- Recommender systems: Techniques, applications, and challenges. Recommender Systems Handbook, pages 1–35, 2021.
- Dilemma of data sharing alliance: when do competing personalizing and non-personalizing firms share data. Production and Operations Management, 29(8):1918–1936, 2020.
- Christopher S Tang. A review of marketing–operations interface models: From co-existence to coordination and collaboration. International Journal of Production Economics, 125(1):22–40, 2010.
- Proximal policy optimization algorithms. 2017.
- Asynchronous methods for deep reinforcement learning. Proc. Int. Conf. Mach. Learn., 48:1928–1937, 20–22 Jun 2016.
- Can deep reinforcement learning improve inventory management? Performance on lost sales, dual-sourcing, and multi-echelon problems. Manuf. Serv. Oper. Manag., 24(3), 2021.
- Fundamentals of supply chain theory. John Wiley & Sons, 2019.
- A deep Q-network for the beer game: Deep reinforcement learning for inventory optimization. Manuf. Serv. Oper. Manag., 24(1):285–304, 2022.
- Multi-agent deep reinforcement learning for multi-echelon inventory management. 2023.
- Dynamic pricing under competition with data-driven price anticipations and endogenous reference price effects. Journal of Revenue and Pricing Management, 18:451–464, 2019.
- Gunnar T Thowsen. A dynamic, nonstationary inventory problem for a price/quantity setting firm. Naval Research Logistics Quarterly, 22(3):461–476, 1975.
- Coordinating inventory control and pricing strategies with random demand and fixed ordering cost: The finite horizon case. Operations research, 52(6):887–896, 2004.
- A simple heuristic for joint inventory and pricing models with lead time and backorders. Management Science, 62(8):2358–2373, 2016.
- Marketing models, volume 803. Prentice-Hall Englewood Cliffs, NJ, 1992.