Fourier neural operator for learning solutions to macroscopic traffic flow models: Application to the forward and inverse problems (2308.07051v2)
Abstract: Deep learning methods are emerging as popular computational tools for solving forward and inverse problems in traffic flow. In this paper, we study a neural operator framework for learning solutions to nonlinear hyperbolic partial differential equations with applications in macroscopic traffic flow models. In this framework, an operator is trained to map heterogeneous and sparse traffic input data to the complete macroscopic traffic state in a supervised learning setting. We chose a physics-informed Fourier neural operator ($\pi$-FNO) as the operator, where an additional physics loss based on a discrete conservation law regularizes the problem during training to improve the shock predictions. We also propose to use training data generated from random piecewise constant input data to systematically capture the shock and rarefied solutions. From experiments using the LWR traffic flow model, we found superior accuracy in predicting the density dynamics of a ring-road network and urban signalized road. We also found that the operator can be trained using simple traffic density dynamics, e.g., consisting of $2-3$ vehicle queues and $1-2$ traffic signal cycles, and it can predict density dynamics for heterogeneous vehicle queue distributions and multiple traffic signal cycles $(\geq 2)$ with an acceptable error. The extrapolation error grew sub-linearly with input complexity for a proper choice of the model architecture and training data. Adding a physics regularizer aided in learning long-term traffic density dynamics, especially for problems with periodic boundary data.
- Learning-based solutions to nonlinear hyperbolic PDEs: Empirical insights on generalization errors, in: Machine Learning and the Physical Sciences Workshop, NeurIPS 2022, 2022. URL: https://ml4physicalsciences.github.io/2022/files/NeurIPS_ML4PS_2022_106.pdf.
- A generative car-following model conditioned on driving styles, Transportation Research Part C: Emerging Technologies 145 (2022) 103926. URL: https://www.sciencedirect.com/science/article/pii/S0968090X22003394. doi:https://doi.org/10.1016/j.trc.2022.103926.
- A physics-informed deep learning paradigm for car-following models, Transportation Research Part C: Emerging Technologies 130 (2021) 103240. URL: https://www.sciencedirect.com/science/article/pii/S0968090X21002539. doi:https://doi.org/10.1016/j.trc.2021.103240.
- Macroscopic traffic flow modeling with physics regularized gaussian process: A new insight into machine learning applications in transportation, Transportation Research Part B: Methodological 146 (2021) 88–110. URL: https://www.sciencedirect.com/science/article/pii/S0191261521000369. doi:https://doi.org/10.1016/j.trb.2021.02.007.
- Generalized adaptive smoothing based neural network architecture for traffic state estimation, in: Proceedings of the 22nd IFAC World Congress, volume 56, 2023, pp. 3483–3490. doi:10.1016/j.ifacol.2023.10.1502, to appear.
- Incorporating kinematic wave theory into a deep learning method for high-resolution traffic speed estimation, IEEE Transactions on Intelligent Transportation Systems 23 (2022) 17849–17862. doi:10.1109/TITS.2022.3157439.
- Learning traffic speed dynamics from visualizations, in: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC 2021), 2021, pp. 1239–1244. doi:10.1109/ITSC48978.2021.9564541.
- A physics-informed deep learning paradigm for traffic state and fundamental diagram estimation, IEEE Transactions on Intelligent Transportation Systems (2021) 1–11. doi:10.1109/TITS.2021.3106259.
- A. J. Huang, S. Agarwal, Physics informed deep learning for traffic state estimation, in: 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), 2020, pp. 1–6. doi:10.1109/ITSC45102.2020.9294236.
- Learning traffic flow dynamics using random fields, IEEE Access 7 (2019) 130566–130577.
- Network-scale traffic prediction via knowledge transfer and regional mfd analysis, Transportation Research Part C: Emerging Technologies 141 (2022) 103719. URL: https://www.sciencedirect.com/science/article/pii/S0968090X22001565. doi:https://doi.org/10.1016/j.trc.2022.103719.
- Short-term traffic prediction using physics-aware neural networks, Transportation Research Part C: Emerging Technologies 142 (2022) 103772. URL: https://www.sciencedirect.com/science/article/pii/S0968090X22002030. doi:https://doi.org/10.1016/j.trc.2022.103772.
- Learning-based traffic state reconstruction using probe vehicles, IFAC-PapersOnLine 54 (2021) 87–92. URL: https://www.sciencedirect.com/science/article/pii/S2405896321004420. doi:https://doi.org/10.1016/j.ifacol.2021.06.013, 16th IFAC Symposium on Control in Transportation Systems CTS 2021.
- A physics-informed reinforcement learning-based strategy for local and coordinated ramp metering, Transportation Research Part C: Emerging Technologies 137 (2022) 103584. URL: https://www.sciencedirect.com/science/article/pii/S0968090X22000304. doi:https://doi.org/10.1016/j.trc.2022.103584.
- X. Di, R. Shi, A survey on autonomous vehicle control in the era of mixed-autonomy: From physics-based to ai-guided driving policy learning, Transportation Research Part C: Emerging Technologies 125 (2021) 103008. URL: https://www.sciencedirect.com/science/article/pii/S0968090X21000401. doi:https://doi.org/10.1016/j.trc.2021.103008.
- Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, Journal of Computational Physics 378 (2019) 686–707. URL: https://www.sciencedirect.com/science/article/pii/S0021999118307125. doi:https://doi.org/10.1016/j.jcp.2018.10.045.
- Learning nonlinear operators via deeponet based on the universal approximation theorem of operators, Nature Machine Intelligence 3 (2021) 218–229.
- Fourier neural operator for parametric partial differential equations, in: International Conference on Learning Representations, 2021. URL: https://openreview.net/forum?id=c8P9NQVtmnO.
- Understanding and mitigating gradient flow pathologies in physics-informed neural networks, SIAM Journal on Scientific Computing 43 (2021) A3055–A3081. URL: https://doi.org/10.1137/20M1318043. doi:10.1137/20M1318043.
- M. Lighthill, G. Whitham, On kinematic waves. II. A theory of traffic flow on long crowded roads, in: Royal Society of London. Series A, Mathematical and Physical Sciences, volume 229, 1955, pp. 317–345.
- P. I. Richards, Shock Waves on the Highway, Operations Research 4 (1956) 42–51.
- Analysis of interrupted traffic flow by finite difference methods, Transportation Research Part B: Methodological 18 (1984) 409–421. URL: https://www.sciencedirect.com/science/article/pii/0191261584900213. doi:https://doi.org/10.1016/0191-2615(84)90021-3.
- D. E. Beskos, P. G. Michalopoulos, An application of the finite element method in traffic signal analysis, Mechanics Research Communications 11 (1984) 185–189. doi:10.1016/0093-6413(84)90061-2.
- C. F. Daganzo, The cell transmission model: A dynamic representation of highway traffic consistent with the hydrodynamic theory, Transportation Research Part B: Methodological 28 (1994) 269–287.
- C. F. Daganzo, The cell transmission model, part ii: Network traffic, Transportation Research Part B: Methodological 29 (1995) 79–93. URL: https://www.sciencedirect.com/science/article/pii/019126159400022R. doi:https://doi.org/10.1016/0191-2615(94)00022-R.
- J. P. Lebacque, The godunov scheme and what it means for first order traffic flow models, in: Internaional Symposium on Transportation and Traffic Theory, 1996, pp. 647–677.
- C. F. Daganzo, A variational formulation of kinematic waves: Solution methods, Transportation Research Part B: Methodological 39 (2005) 934–950. URL: https://linkinghub.elsevier.com/retrieve/pii/S0191261505000068. doi:10.1016/j.trb.2004.05.003.
- C. G. Claudel, A. M. Bayen, Lax–hopf based incorporation of internal boundary conditions into hamilton–jacobi equation. part i: Theory, IEEE Transactions on Automatic Control 55 (2010) 1142–1157. doi:10.1109/TAC.2010.2041976.
- Analytical and grid-free solutions to the Lighthill–Whitham–Richards traffic flow model, Transportation Research Part B: Methodological 45 (2011) 1727–1748. URL: https://linkinghub.elsevier.com/retrieve/pii/S0191261511001044. doi:10.1016/j.trb.2011.07.004.
- The lagrangian coordinates and what it means for first order traffic flow models, in: Transportation and traffic theory 2007: papers selected for presentation at ISTTT17, 2007.
- On sequential data assimilation for scalar macroscopic traffic flow models, Physica D: Nonlinear Phenomena 241 (2012) 1421–1440. URL: https://www.sciencedirect.com/science/article/pii/S0167278912001339. doi:https://doi.org/10.1016/j.physd.2012.05.005.
- Real-time lagrangian traffic state estimator for freeways, IEEE Transactions on Intelligent Transportation Systems 13 (2012) 59–70. doi:10.1109/TITS.2011.2178837.
- S. E. Jabari, H. Liu, A stochastic model of traffic flow: Gaussian approximation and estimation, Transportation Research Part B: Methodological 47 (2013) 15–41.
- E. S. Canepa, C. G. Claudel, Networked traffic state estimation involving mixed fixed-mobile sensor data using hamilton-jacobi equations, Transportation Research Part B: Methodological 104 (2017) 686–709. URL: https://www.sciencedirect.com/science/article/pii/S0191261516302983. doi:https://doi.org/10.1016/j.trb.2017.05.016.
- Traffic state estimation using stochastic Lagrangian dynamics, Transportation Research Part B: Methodological 115 (2018) 143–165.
- Estimating motorway traffic states with data fusion and physics-informed deep learning, in: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), 2021, pp. 2208–2214. doi:10.1109/ITSC48978.2021.9565096.
- A. J. Huang, S. Agarwal, Physics-informed deep learning for traffic state estimation: Illustrations with LWR and CTM models, IEEE Open Journal of Intelligent Transportation Systems 3 (2022) 503–518. doi:10.1109/OJITS.2022.3182925.
- Thermodynamically consistent physics-informed neural networks for hyperbolic systems, Journal of Computational Physics 449 (2022) 110754. URL: https://www.sciencedirect.com/science/article/pii/S0021999121006495. doi:https://doi.org/10.1016/j.jcp.2021.110754.
- Weak physics informed neural networks for approximating entropy solutions of hyperbolic conservation laws, SAM Research Report 2022 (2022).
- Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems, Computer Methods in Applied Mechanics and Engineering 365 (2020) 113028. URL: https://www.sciencedirect.com/science/article/pii/S0045782520302127. doi:https://doi.org/10.1016/j.cma.2020.113028.
- D. Yarotsky, Error bounds for approximations with deep relu networks, Neural Networks 94 (2017) 103–114. URL: https://www.sciencedirect.com/science/article/pii/S0893608017301545. doi:https://doi.org/10.1016/j.neunet.2017.07.002.
- Distributed learning machines for solving forward and inverse problems in partial differential equations, Neurocomputing 420 (2021) 299–316. URL: https://www.sciencedirect.com/science/article/pii/S0925231220314090. doi:https://doi.org/10.1016/j.neucom.2020.09.006.
- A comprehensive study of non-adaptive and residual-based adaptive sampling for physics-informed neural networks, Computer Methods in Applied Mechanics and Engineering 403 (2023) 115671. URL: https://www.sciencedirect.com/science/article/pii/S0045782522006260. doi:https://doi.org/10.1016/j.cma.2022.115671.
- Physics-informed neural networks for high-speed flows, Computer Methods in Applied Mechanics and Engineering 360 (2020) 112789. URL: https://www.sciencedirect.com/science/article/pii/S0045782519306814. doi:https://doi.org/10.1016/j.cma.2019.112789.
- On universal approximation and error bounds for fourier neural operators, Journal of Machine Learning Research 22 (2021) Art–No.
- Learning the solution operator of parametric partial differential equations with physics-informed deeponets, Science Advances 7 (2021) eabi8605. URL: https://www.science.org/doi/abs/10.1126/sciadv.abi8605. doi:10.1126/sciadv.abi8605.
- S. E. Jabari, Node modeling for congested urban road networks, Transportation Research Part B: Methodological 91 (2016) 229–249. URL: https://www.sciencedirect.com/science/article/pii/S0191261516303381. doi:https://doi.org/10.1016/j.trb.2016.06.001.
- B. D. Greenshields, A study of traffic capacity, in: Proceedings of the highway research board, volume 14, 1935, pp. 448–477.
- Multipole graph neural operator for parametric partial differential equations, Advances in Neural Information Processing Systems 33 (2020) 6755–6766.
- T. Tripura, S. Chakraborty, Wavelet neural operator for solving parametric partial differential equations in computational mechanics problems, Computer Methods in Applied Mechanics and Engineering 404 (2023) 115783. URL: https://www.sciencedirect.com/science/article/pii/S0045782522007393. doi:https://doi.org/10.1016/j.cma.2022.115783.
- T. D. Ryck, S. Mishra, Generic bounds on the approximation error for physics-informed (and) operator learning, in: A. H. Oh, A. Agarwal, D. Belgrave, K. Cho (Eds.), Advances in Neural Information Processing Systems, 2022. URL: https://openreview.net/forum?id=bF4eYy3LTR9.
- Nonlinear traffic prediction as a matrix completion problem with ensemble learning, Transportation Science 56 (2022) 52–78. doi:10.1287/trsc.2021.1086.
- Generalized adaptive smoothing using matrix completion for traffic state estimation, in: 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), 2022, pp. 787–792. doi:10.1109/ITSC55140.2022.9921908.
- E. Barmpounakis, N. Geroliminis, On the new era of urban traffic monitoring with massive drone data: The pNEUMA large-scale field experiment, Transportation Research Part C: Emerging Technologies 111 (2020) 50–71. URL: https://www.sciencedirect.com/science/article/pii/S0968090X19310320. doi:https://doi.org/10.1016/j.trc.2019.11.023.
- Lane detection and lane-changing identification with high-resolution data from a swarm of drones, Transportation Research Record 2674 (2020) 1–15. URL: https://doi.org/10.1177/0361198120920627. doi:10.1177/0361198120920627.
- J. J. Yu, Online traffic speed estimation for urban road networks with few data: A transfer learning approach, in: 2019 IEEE Intelligent Transportation Systems Conference (ITSC), 2019, pp. 4024–4029. doi:10.1109/ITSC.2019.8917502.
- S. Ben-David, R. Schuller, Exploiting task relatedness for multiple task learning, in: B. Schölkopf, M. K. Warmuth (Eds.), Learning Theory and Kernel Machines, Springer Berlin Heidelberg, Berlin, Heidelberg, 2003, pp. 567–580.