Drift Control of High-Dimensional RBM: A Computational Method Based on Neural Networks (2309.11651v4)
Abstract: Motivated by applications in queueing theory, we consider a stochastic control problem whose state space is the $d$-dimensional positive orthant. The controlled process $Z$ evolves as a reflected Brownian motion whose covariance matrix is exogenously specified, as are its directions of reflection from the orthant's boundary surfaces. A system manager chooses a drift vector $\theta(t)$ at each time $t$ based on the history of $Z$, and the cost rate at time $t$ depends on both $Z(t)$ and $\theta(t)$. In our initial problem formulation, the objective is to minimize expected discounted cost over an infinite planning horizon, after which we treat the corresponding ergodic control problem. Extending earlier work by Han et al. (Proceedings of the National Academy of Sciences, 2018, 8505-8510), we develop and illustrate a simulation-based computational method that relies heavily on deep neural network technology. For test problems studied thus far, our method is accurate to within a fraction of one percent, and is computationally feasible in dimensions up to at least $d=30$.
- Tensorflow: a system for large-scale machine learning. In OSDI, volume 16, pages 265–283. Savannah, GA, USA, 2016.
- Barış Ata. Dynamic control of a multiclass queue with thin arrival streams. Operations Research, 54(5):876–892, 2006.
- An approximate analysis of dynamic pricing, outsourcing, and scheduling policies for a multiclass make-to-stock queue in the heavy traffic regime. Operations Research, 71(1):341–357, 2023.
- Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems. 2023.
- Drift rate control of a Brownian processing system. Annals of Applied Probability, 15(2):1145–1160, 2005.
- Dynamic volunteer staffing in multicrop gleaning operations. Operations Research, 67(2):295–314, 2019.
- Drift control of international reserves. Journal of Economic Dynamics & Control, 31:3110–3137, 2007.
- An overview on deep learning-based approximation methods for partial differential equations. Discrete and Continuous Dynamical Systems - Series B, 28(6):3697–3746, 2023.
- Patrick Billingsley. Convergence of probability measures (2nd edition). John Wiley & Sons, 1999.
- Efficient steady-state simulation of high-dimensional stochastic networks. Stochastic Systems, 11(2):174–192, 2021.
- Long time asymptotics for controlled diffusions in polyhedral domains. Stochastic Processes and Their Applications, 117(8):1014–1036, 2007.
- Dynamic pricing and lead-time quotation for a multiclass make-to-order queue. Management Science, 54(6):1132–1146, 2008.
- Existence and uniqueness of semimartingale reflecting brownian motions in convex polyhedrons. Theory of Probability & Its Applications, 40(1):1–40, 1996.
- Algorithms for solving high dimensional pdes: from nonlinear monte carlo to machine learning. Nonlinearity, 35:278–310, 2022.
- Optimal buffer size for a stochastic processing network in heavy traffic. Queueing Systems, 55(3):147–159, 2007.
- Optimal buffer size and dynamic rate control for a queueing system with impatient customers in heavy traffic. Stochastic Processes and Their Applications, 120(11):2103–2141, 2010.
- Solving high-dimensional partial differential equations using deep learning. Proceedings of the National Academy of Sciences, 115(34):8505–8510, 2018.
- J Michael Harrison. Brownian models of queueing networks with heterogeneous customer populations. In Stochastic differential systems, stochastic control theory and applications, pages 147–186. Springer, 1988.
- J Michael Harrison. Brownian models of open processing networks: Canonical representation of workload. The Annals of Applied Probability, 10(1):75–103, 2000.
- Brownian models of multiclass queueing networks: Current status and open problems. Queueing Systems, 13:5–40, 1993.
- Reflected brownian motion on an orthant. The Annals of Probability, 9(2):302–308, 1981.
- Scheduling networks of queues: heavy traffic analysis of a simple open network. Queueing Systems, 5:265–279, 1989.
- Scheduling networks of queues: Heavy traffic analysis of a two-station closed network. Operations research, 38(6):1052–1064, 1990.
- Brownian models of open queueing networks with homogeneous customer populations. Stochastics: An International Journal of Probability and Stochastic Processes, 22(2):77–115, 1987.
- Sepp Hochreiter. The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(02):107–116, 1998.
- Multiple channel queues in heavy traffic. i. Advances in Applied Probability, 2(1):150–177, 1970a.
- Multiple channel queues in heavy traffic. ii: Sequences, networks, and batches. Advances in Applied Probability, 2(2):355–369, 1970b.
- Ioannis Karatzas. A class of singular control problems. Advances in Applied Probability, 15(2):225–254, 1983.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Diffusion approximation for GI/G/1 controlled queues. Queueing Systems, 12:333–367, 1992.
- Numerical methods for stochastic singular control problems. SIAM journal on control and optimization, 29(6):1443–1475, 1991.
- Harold Joseph Kushner. Heavy traffic analysis of controlled queueing and communication networks, volume 28. Springer, 2001.
- Heavy traffic convergence of a controlled, multiclass queueing system. SIAM journal on control and optimization, 34(6):2133–2171, 1996.
- Routing and singular control for queueing networks in heavy traffic. SIAM journal on control and optimization, 28(5):1209–1233, 1990.
- Bernt Oksendal. Stochastic differential equations: an introduction with applications (sixth edition). Springer Science & Business Media, 2003.
- Drift control with changeover costs. Operations Research, 59(2):427–439, 2011.
- William P Peterson. A heavy traffic limit theorem for networks of queues with multiple customer types. Mathematics of operations research, 16(1):90–118, 1991.
- A review of activation function for artificial neural network. In 2020 IEEE 18th World Symposium on Applied Machine Intelligence and Informatics (SAMI), pages 281–286. IEEE, 2020.
- Martin I Reiman. Open queueing networks in heavy traffic. Mathematics of operations research, 9(3):441–458, 1984.
- Dynamic control of a make-to-order, parallel-server system with cancellations. Operations Research, 57(1):94–108, 2009.
- Existence and uniqueness of semimartingale reflecting Brownian motions in an orthant. Probability Theory and Related Fields, 96(3):283–317, 1993.
- John H. Vande Vate. Average cost Brownian drift control with proportional changeover costs. Stochastic Systems, 11(3):218–263, 2021.
- Lawrence M Wein. Brownian networks with discretionary routing. Operations Research, 39(2):322–340, 1991.
- Ruth J Williams. On the approximation of queueing networks in heavy traffic. Stochastic Networks: Theory and Applications, 4:35–56, 1996.
- Ruth J Williams. Diffusion approximations for open multiclass queueing networks: sufficient conditions involving state space collapse. Queueing systems, 30:27–88, 1998a.
- Ruth J Williams. An invariance principle for semimartingale reflecting brownian motions in an orthant. Queueing Systems, 30:5–25, 1998b.
- Andreas Winkelbauer. Moments and absolute moments of the normal distribution. arXiv preprint arXiv:1209.4340, 2012.
- Wasserstein control of mirror langevin monte carlo. In Conference on Learning Theory, pages 3814–3841. PMLR, 2020.
- Actor-critic method for high dimensional static hamilton–jacobi–bellman partial differential equations based on neural networks. SIAM Journal on Scientific Computing, 43(6):A4043–A4066, 2021a.
- Code for “actor-critic method for high dimensional static hamilton–jacobi–bellman partial differential equations based on neural networks”. 2021b. URL https://github.com/MoZhou1995/DeepPDE_ActorCritic.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.