RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging (2403.03359v2)
Abstract: Autonomous parallel-style on-ramp merging in human controlled traffic continues to be an existing issue for autonomous vehicle control. Existing non-learning based solutions for vehicle control rely on rules and optimization primarily. These methods have been seen to present significant challenges. Recent advancements in Deep Reinforcement Learning have shown promise and have received significant academic interest however the available learning based approaches show inadequate attention to other highway vehicles and often rely on inaccurate road traffic assumptions. In addition, the parallel-style case is rarely considered. A novel learning based model for acceleration and lane change decision making that explicitly considers the utility to both the ego vehicle and its surrounding vehicles which may be cooperative or uncooperative to produce behaviour that is socially acceptable is proposed. The novel reward function makes use of Social Value Orientation to weight the vehicle's level of social cooperation and is divided into ego vehicle and surrounding vehicle utility which are weighted according to the model's designated Social Value Orientation. A two-lane highway with an on-ramp divided into a taper-style and parallel-style section is considered. Simulation results indicated the importance of considering surrounding vehicles in reward function design and show that the proposed model matches or surpasses those in literature in terms of collisions while also introducing socially courteous behaviour avoiding near misses and anti-social behaviour through direct consideration of the effect of merging on surrounding vehicles.
- S. Abuelsamid. (2023, 8) Waymo expands driverless robotaxi service to most of metro phoenix. [Online]. Available: https://www.forbes.com/sites/samabuelsamid/2023/05/04/waymo-expands-driverless-robotaxi-service-most-of-metro-phoenix/?sh=44b800f37bef
- A. Roy and A. Sriram. (2023, 8) Zoox headcount grows as amazon’s self-driving unit expands testing in vegas. [Online]. Available: https://www.reuters.com/business/autos-transportation/zoox-headcount-grows-amazons-self-driving-unit-expands-testing-vegas-2023-06-27/
- T. Litman, “Autonomous vehicle implementation predictions,” Victoria Transport Policy Institute Victoria, Tech. Rep., 2017.
- H. Wang, S. Yuan, M. Guo, X. Li, and W. Lan, “A deep reinforcement learning-based approach for autonomous driving in highway on-ramp merge,” Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering, vol. 235, pp. 2726–2739, 2 2021. [Online]. Available: https://doi.org/10.1177/0954407021999480
- J. Zhu, S. Easa, and K. Gao, “Merging control strategies of connected and autonomous vehicles at freeway on-ramps: a comprehensive review,” Journal of Intelligent and Connected Vehicles, vol. 5, pp. 99–111, 1 2022.
- J. Rios-Torres and A. A. Malikopoulos, “A survey on the coordination of connected and automated vehicles at intersections and merging at highway on-ramps,” IEEE Transactions on Intelligent Transportation Systems, vol. 18, no. 5, pp. 1066–1077, 2017.
- Q. Liu, F. Dang, X. Wang, and X. Ren, “Autonomous highway merging in mixed traffic using reinforcement learning and motion predictive safety controller,” in 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), 2022, pp. 1063–1069.
- J. Ding, L. Li, H. Peng, and Y. Zhang, “A rule-based cooperative merging strategy for connected and automated vehicles,” IEEE Transactions on Intelligent Transportation Systems, vol. 21, pp. 3436–3446, 7 2020.
- K. Muhammad, A. Ullah, J. Lloret, J. D. Ser, and V. H. C. de Albuquerque, “Deep learning for safe autonomous driving: Current challenges and future directions,” IEEE Transactions on Intelligent Transportation Systems, vol. 22, pp. 4316–4336, 12 2021.
- S. Triest, A. Villaflor, and J. M. Dolan, “Learning highway ramp merging via reinforcement learning with temporally-extended actions,” in 2020 IEEE Intelligent Vehicles Symposium (IV), Las Vegas, USA, 10 2020, pp. 1595–1600.
- Y. Lin, J. McPhee, and N. L. Azad, “Anti-jerk on-ramp merging using deep reinforcement learning,” in 2020 IEEE Intelligent Vehicles Symposium (IV). Las Vegas, USA: IEEE, 10 2020, pp. 7–14.
- X. Nie, Y. Liang, and K. Ohkura, “Autonomous highway driving using reinforcement learning with safety check system based on time-to-collision,” Artif Life Robotics, vol. 28, no. 1, pp. 158–165, 2023.
- D. W. Griesinger and J. W. Livingston Jr., “Toward a model of interpersonal motivation in experimental games,” Behavioral Science, vol. 18, no. 3, pp. 173–188, 1973. [Online]. Available: https://onlinelibrary.wiley.com/doi/abs/10.1002/bs.3830180305
- P. Kachroo and Z. Li, “Vehicle merging control design for an automated highway system,” in Proceedings of Conference on Intelligent Transportation Systems, Boston, USA, 11 1997, pp. 224–229.
- C. Baker and J. Dolan, “Traffic interaction in the urban challenge: Putting boss on its best behavior,” in 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems. Nice, France: IEEE, 9 2008, pp. 1752–1758.
- V. Milanes, J. Godoy, J. Villagra, and J. Perez, “Automated on-ramp merging system for congested traffic situations,” IEEE Transactions on Intelligent Transportation Systems, vol. 12, no. 2, pp. 500–508, 2011.
- C. Yang and K. Kurami, “Longitudinal guidance and control for the entry of vehicles onto automated highways,” in Proceedings of 32nd IEEE Conference on Decision and Control, 1993, pp. 1891–1896 vol.2.
- K. Liu, N. Li, H. E. Tseng, I. Kolmanovsky, and A. Girard, “Interaction-aware trajectory prediction and planning for autonomous vehicles in forced merge scenarios,” IEEE Transactions on Intelligent Transportation Systems, vol. 24, no. 1, pp. 474–488, 2023.
- H. Okuda, T. Suzuki, K. Harada, S. Saigo, and S. Inoue, “Quantitative driver acceptance modeling for merging car at highway junction and its application to the design of merging behavior control,” IEEE Transactions on Intelligent Transportation Systems, vol. 22, no. 1, pp. 329–340, 2021.
- S. Wu, D. Tian, J. Zhou, X. Duan, Z. Sheng, and D. Zhao, “Autonomous on-ramp merge strategy using deep reinforcement learning in uncertain highway environment,” in 2022 IEEE International Conference on Unmanned Systems (ICUS), Guangzhou, China, 10 2022, pp. 658–663.
- J. Lubars, H. Gupta, S. Chinchali, L. Li, A. Raja, R. Srikant, and X. Wu, “Combining reinforcement learning with model predictive control for on-ramp merging,” in 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, USA, 9 2021, pp. 942–947.
- L. Crosato, C. Wei, E. S. L. Ho, and H. P. H. Shum, “Human-centric autonomous driving in an av-pedestrian interactive environment using svo,” in 2021 IEEE 2nd International Conference on Human-Machine Systems (ICHMS), 2021, pp. 1–6.
- L. Crosato, H. P. H. Shum, E. S. L. Ho, and C. Wei, “Interaction-aware decision-making for automated vehicles using social value orientation,” IEEE Transactions on Intelligent Vehicles, vol. 8, no. 2, pp. 1339–1349, 2023.
- B. Toghi, R. Valiente, D. Sadigh, R. Pedarsani, and Y. P. Fallah, “Cooperative autonomous vehicles that sympathize with human drivers,” in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 4517–4524.
- W. Schwarting, A. Pierson, J. Alonso-Mora, S. Karaman, and D. Rus, “Social behavior for autonomous vehicles,” Proceedings of the National Academy of Sciences, vol. 116, no. 50, pp. 24 972–24 978, 2019. [Online]. Available: https://www.pnas.org/doi/abs/10.1073/pnas.1820676116
- J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv preprint arXiv:1707.06347, 7 2017.
- M. Treiber, A. Hennecke, and D. Helbing, “Congested traffic states in empirical observations and microscopic simulations,” Phys. Rev. E, vol. 62, pp. 1805–1824, 8 2000. [Online]. Available: https://link.aps.org/doi/10.1103/PhysRevE.62.1805
- G. Brockman, V. Cheung, L. Pettersson, J. Schneider, J. Schulman, J. Tang, and W. Zaremba, “Openai gym,” arXiv preprint arXiv:1606.01540, 2016.
- P. A. Lopez, M. Behrisch, L. Bieker-Walz, J. Erdmann, Y.-P. Flötteröd, R. Hilbrich, L. Lücken, J. Rummel, P. Wagner, and E. Wießner, “Microscopic traffic simulation using sumo,” in The 21st IEEE International Conference on Intelligent Transportation Systems. IEEE, 2018. [Online]. Available: https://elib.dlr.de/124092/
- A. Raffin, A. Hill, A. Gleave, A. Kanervisto, M. Ernestus, and N. Dormann, “Stable-baselines3: Reliable reinforcement learning implementations,” Journal of Machine Learning Research, vol. 22, no. 268, pp. 1–8, 2021. [Online]. Available: http://jmlr.org/papers/v22/20-1364.html
- A. Dosovitskiy, G. Ros, F. Codevilla, A. Lopez, and V. Koltun, “CARLA: An open urban driving simulator,” in Proceedings of the 1st Annual Conference on Robot Learning, ser. Proceedings of Machine Learning Research, S. Levine, V. Vanhoucke, and K. Goldberg, Eds., vol. 78. Aukland, NZ: PMLR, 13–15 Nov 2017, pp. 1–16.
- Jordan Poots (1 paper)