RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models (2405.00449v1)
Abstract: Prediction of road users' behaviors in the context of autonomous driving has gained considerable attention by the scientific community in the last years. Most works focus on predicting behaviors based on kinematic information alone, a simplification of the reality since road users are humans, and as such they are highly influenced by their surrounding context. In addition, a large plethora of research works rely on powerful Deep Learning techniques, which exhibit high performance metrics in prediction tasks but may lack the ability to fully understand and exploit the contextual semantic information contained in the road scene, not to mention their inability to provide explainable predictions that can be understood by humans. In this work, we propose an explainable road users' behavior prediction system that integrates the reasoning abilities of Knowledge Graphs (KG) and the expressiveness capabilities of LLMs (LLM) by using Retrieval Augmented Generation (RAG) techniques. For that purpose, Knowledge Graph Embeddings (KGE) and Bayesian inference are combined to allow the deployment of a fully inductive reasoning system that enables the issuing of predictions that rely on legacy information contained in the graph as well as on current evidence gathered in real time by onboard sensors. Two use cases have been implemented following the proposed approach: 1) Prediction of pedestrians' crossing actions; 2) Prediction of lane change maneuvers. In both cases, the performance attained surpasses the current state of the art in terms of anticipation and F1-score, showing a promising avenue for future research in this field.
- W. H. Organization, “Global status report on road safety,” 2023.
- F. Slootmans, “European road safety observatory. technical report. european comission,” 2022.
- T. Stewart, “Overview of motor vehicle traffic crashes,” 2021.
- J. K. T. Iuliia Kotseruba, Amir Rasouli, “Benchmark for evaluating pedestrian action prediction,” Technical report, 2021.
- D. M. G. E. A. I. Pool, J. F. P. Kooij, “Crafted vs. learned representations in predictive models: A case study on cyclist path prediction,” IEEE Transactions on Intelligent Vehicles, 2021.
- R. I. et al, “Vehicle trajectory prediction on highways using bird eye view representations and deep learning,” Applied Intelligence, pp. 1–19, 2022.
- R. S. T. Schulz, “A controlled interactive multiple model filter for combined pedestrian intention recognition and path prediction,” IEEE Intelligent Transportation Systems Conference, pp. 173–178, 2015.
- R. I. et al, “A. su, k. muelling, j. dolan, p. palanisamy, p. mudalige,” IEEE Intelligent Vehicles Symposium, pp. 1412–1417, 2018.
- A. Benterki, M. Boukhnifer, V. Judalet, and M. Choubeila, “Prediction of surrounding vehicles lane change intention using machine learning,” IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, pp. 839–843, 2019.
- R. Izquierdo, A. Quintanar, I. Parra, D. Fernández-Llorca, and M. A. Sotelo, “Experimental validation of lane-change intention prediction methodologies based on cnn and lstm,” IEEE Intelligent Transportation Systems Conference, pp. 3657–3662, 2019.
- R. Izquierdo, A. Quintanar, I. Parra-Alonso, D. F. Llorca, and M. A. Sotelo, “The prevention – a novel benchmark for prediction of vehicles intentions,” IEEE Intelligent Transportation Systems Conference, 2019.
- O. Laimona, M. A. Manzour, O. M. Shehata, and E. I. Morgan, “Implementation and evaluation of an enhanced intention prediction algorithm for lane-changing scenarios on highway roads,” 2nd Novel Intelligent and Leading Emerging Sciences Conference, pp. 128–133, 2020.
- J. L. Q. Xue, Y. Xing, “An integrated lane change prediction model incorporating traffic context based on trajectory data,” Transportation research part C: emerging technologies, vol. 141, 2022.
- R. K. et al, “Experimental insights towards explainable and interpretable pedestrian crossing prediction,” 2018.
- K. Gao, X. Li, B. Chen, L. Hu, J. Liu, R. Du, and Y. Li, “Dual transformer based prediction for lane change intentions and trajectories in mixed traffic environment,” IEEE Transactions on Intelligent Transportation Systems, 2023.
- J. K. T. Iuliia Kotseruba, Amir Rasouli, “Do they want to cross? understanding pedestrian intention for behavior prediction,” IEEE Intelligent Vehicles Symposium (IV), pp. 1688–1693, 2020.
- M. A. S. Javier Lorenzo, Ignacio Parra, “Capformer: Pedestrian crossing action prediction using transformer,” Sensors, 2021.
- D. T. et al, “Learning spatiotemporal features with 3d convolutional networks,” 2015.
- J. K. T. Amir Rasouli, Iuliia Kotseruba, “Pedestrian action anticipation using contextual feature fusion in stacked rnns,” 2020.
- X. et al, “Convolutional lstm network: A machine learning approach for precipitation nowcasting,” Advances in Neural Information Processing Systems, vol. 28, 2015.
- L. A. et al, “Analysis over vision-based models for pedestrian action anticipation,” 2023.
- N. M. et al, “Emidas: explainable social interaction-based pedestrian intention detection across street,” Annual ACM Symposium on Applied Computing, 2021.
- K. Yi, J. Wu, C. Gan, A. Torralba, P. Kohli, and J. B. Tenenbaum, “Neural-symbolic vqa: Disentangling reasoning from vision and language understanding,” 2019.
- A. H. et al, “Knowledge graphs,” 2020.
- A. Martin, K. Hinkelmann, H.-G. Fill, A. Gerber, D. Lenat, R. Stolle, and F. van Harmelen, “An evaluation of knowledge graph embeddings for autonomous driving data: Experience and practice,” Proceedings of the AAAI 2020 Spring Symposium on Combining Machine Learning and Knowledge Engineering in Practice, 2020.
- C. Peng, F. Xia, M. Naseriparsa, and F. Osborne, “Knowledge graphs: Opportunities and challenges,” 2023.
- S. Choudhary, T. Luthra, A. Mittal, and R. Singh, “A survey of knowledge graph embedding and their applications,” 2021.
- A. Bordes, N. Usunier, A. Garcia-Duran, J. Weston, and O. Yakhnenko, “Translating embeddings for modeling multi-relational data,” in Advances in Neural Information Processing Systems, C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Weinberger, Eds., vol. 26. Curran Associates, Inc., 2013. [Online]. Available: https://proceedings.neurips.cc/paper_files/paper/2013/file/1cecc7a77928ca8133fa24680a88d2f9-Paper.pdf
- T. Trouillon, J. Welbl, S. Riedel, É. Gaussier, and G. Bouchard, “Complex embeddings for simple link prediction,” CoRR, vol. abs/1606.06357, 2016. [Online]. Available: http://arxiv.org/abs/1606.06357
- T. Chen, T. Jing, R. Tian, Y. Chen, J. Domeyer, H. Toyoda, R. Sherony, and Z. Ding, “Psi: A pedestrian behavior dataset for socially intelligent autonomous car,” arXiv preprint arXiv:2112.02604, 2021.
- A. N. Melo, L. F. Herrera-Quintero, C. Salinas, and M. A. Sotelo, “Knowledge-based explainable pedestrian behavior predictor,” 2024.
- R. Krajewski, J. Bock, L. Kloeker, and L. Eckstein, “The highd dataset: A drone dataset of naturalistic vehicle trajectories on german highways for validation of highly automated driving systems,” in 2018 21st International Conference on Intelligent Transportation Systems (ITSC), 2018, pp. 2118–2125.
- M. Manzour, A. Ballardini, R. Izquierdo, and M. Sotelo, “Vehicle lane change prediction based on knowledge graph embeddings and bayesian inference,” arXiv preprint arXiv:2312.06336, 2023.
- M. Saffarzadeh, N. Nadimi, S. Naseralavi, and A. R. Mamdoohi, “A general formulation for time-to-collision safety indicator,” in Proceedings of the Institution of Civil Engineers-Transport, vol. 166, no. 5. Thomas Telford Ltd, 2013, pp. 294–304.
- E. RAMEZANI-KHANSARI, F. M. NEJAD, and S. MOOGEH, “Comparing time to collision and time headway as safety criteria,” Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, vol. 27, no. 6, pp. 669–675, 2020.
- L. Costabello, A. Bernardi, A. Janik, S. Pai, C. L. Van, R. McGrath, N. McCarthy, and P. Tabacof, “AmpliGraph: a Library for Representation Learning on Knowledge Graphs,” Mar. 2019. [Online]. Available: https://doi.org/10.5281/zenodo.2595043
- J. Sanz, A. Fernandez, H. Bustince, and F. Herrera, “Ivturs: a linguistic fuzzy rule-based classification system based on a new interval-valued fuzzy reasoning method with tuning and rule selection,” IEEE Transactions on Fuzzy Systems, vol. 21, no. 3, pp. 399–411, 2013.
- J. Alcala-Fdez, R. Alcala, and F. Herrera, “A fuzzy association rule-based classification model for high-dimensional problems with genetic rule selection and lateral tuning,” IEEE Transactions on Fuzzy Systems, vol. 19, no. 5, pp. 857–872, 2011.
- H. Ishibuchi and T. Nakashima, “Effect of rule weights in fuzzy rule-based classification systems,” IEEE Transactions on Fuzzy Systems, vol. 9, no. 4, pp. 506–515, 2001.
- P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. Küttler, M. Lewis, W.-t. Yih, T. Rocktäschel et al., “Retrieval-augmented generation for knowledge-intensive nlp tasks,” Advances in Neural Information Processing Systems, vol. 33, pp. 9459–9474, 2020.
- A. N. Melo, C. Salinas, and M. A. Sotelo, “Experimental insights towards explainable and interpretable pedestrian crossing prediction,” 2023.
- C. Han, Q. Zhao, S. Zhang, Y. Chen, Z. Zhang, and J. Yuan, “Yolopv2: Better, faster, stronger for panoptic driving perception,” 2022.
- D. Burgermeister and C. Curio, “Pedrecnet: Multi-task deep neural network for full 3d human pose and orientation estimation,” in 2022 IEEE Intelligent Vehicles Symposium, IV 2022, Aachen, Germany,June 4-9, 2022. IEEE, 2022, pp. 441–448. [Online]. Available: https://doi.org/10.1109/IV51971.2022.9827202
- D. Tran, L. D. Bourdev, R. Fergus, L. Torresani, and M. Paluri, “C3D: generic features for video analysis,” CoRR, vol. abs/1412.0767, 2014. [Online]. Available: http://arxiv.org/abs/1412.0767
- I. Kotseruba, A. Rasouli, and J. K. Tsotsos, “Benchmark for evaluating pedestrian action prediction,” in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), 2021, pp. 1257–1267.
- Q. Xue, Y. Xing, and J. Lu, “An integrated lane change prediction model incorporating traffic context based on trajectory data,” Transportation research part C: emerging technologies, vol. 141, p. 103738, 2022.