An Explainable Stacked Ensemble Model for Static Route-Free Estimation of Time of Arrival (2203.09438v2)
Abstract: To compare alternative taxi schedules and to compute them, as well as to provide insights into an upcoming taxi trip to drivers and passengers, the duration of a trip or its Estimated Time of Arrival (ETA) is predicted. To reach a high prediction precision, machine learning models for ETA are state of the art. One yet unexploited option to further increase prediction precision is to combine multiple ETA models into an ensemble. While an increase of prediction precision is likely, the main drawback is that the predictions made by such an ensemble become less transparent due to the sophisticated ensemble architecture. One option to remedy this drawback is to apply eXplainable Artificial Intelligence (XAI). The contribution of this paper is three-fold. First, we combine multiple machine learning models from our previous work for ETA into a two-level ensemble model - a stacked ensemble model - which on its own is novel; therefore, we can outperform previous state-of-the-art static route-free ETA approaches. Second, we apply existing XAI methods to explain the first- and second-level models of the ensemble. Third, we propose three joining methods for combining the first-level explanations with the second-level ones. Those joining methods enable us to explain stacked ensembles for regression tasks. An experimental evaluation shows that the ETA models correctly learned the importance of those input features driving the prediction.
- “Explainable Artificial Intelligence (XAI) for Exploring Spatial Variability of Lung and Bronchus Cancer (LBC) Mortality Rates in the Contiguous USA” In Scientific Reports 11.1, 2021, pp. 24090 DOI: 10.1038/s41598-021-03198-8
- Md Shad Akhtar, Asif Ekbal and Erik Cambria “How Intense Are You? Predicting Intensities of Emotions and Sentiments Using Stacked Ensemble [Application Notes]” In IEEE Computational Intelligence Magazine 15.1, 2020, pp. 64–75 DOI: 10.1109/MCI.2019.2954667
- “Principles and Practice of Explainable Machine Learning” In Frontiers in Big Data 4 Frontiers Media SA, 2021 DOI: 10.3389/fdata.2021.688969
- Guido Bologna “Transparent Ensembles for Covid-19 Prognosis” In Machine Learning and Knowledge Extraction 12844 Cham: Springer International Publishing, 2021, pp. 351–364 DOI: 10.1007/978-3-030-84060-0_22
- “A Comparison Study on Rule Extraction from Neural Network Ensembles, Boosted Shallow Trees, and SVMs” In Applied Computational Intelligence and Soft Computing 2018, 2018, pp. 1–20 DOI: 10.1155/2018/4084850
- Arthur Cruz de Araujo and Ali Etemad “Deep Neural Networks for Predicting Vehicle Travel Times” In 2019 IEEE SENSORS Montreal, QC, Canada: IEEE, 2019, pp. 1–4 DOI: 10.1109/SENSORS43011.2019.8956878
- Houtao Deng “Interpreting Tree Ensembles with inTrees” In International Journal of Data Science and Analytics 7.4, 2019, pp. 277–287 DOI: 10.1007/s41060-018-0144-8
- “Ensemble Deep Learning: A Review” In Engineering Applications of Artificial Intelligence 115, 2022, pp. 105151 DOI: 10.1016/j.engappai.2022.105151
- “Automated COVID-19 Detection from X-ray and CT Images with Stacked Ensemble Convolutional Neural Network” In Biocybernetics and Biomedical Engineering 42.1, 2022, pp. 27–41 DOI: 10.1016/j.bbe.2021.12.001
- “A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching Using Deep Reinforcement Learning” In IEEE Transactions on Intelligent Transportation Systems 22.12, 2021, pp. 7931–7942 DOI: 10.1109/TITS.2021.3096537
- “Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining” In 2018 IEEE International Conference on Big Data (Big Data) Seattle, WA, USA: IEEE, 2018, pp. 1417–1426 DOI: 10.1109/BigData.2018.8622481
- “Multilayer Dynamic Ensemble Model for Intensive Care Unit Mortality Prediction of Neonate Patients” In Journal of Biomedical Informatics 135, 2022, pp. 104216 DOI: 10.1016/j.jbi.2022.104216
- Kaggle “DC Taxi Trips”, https://www.kaggle.com/bvc5283/dc-taxi-trips, 2019
- Athanasios Kallipolitis, Kyriakos Revelos and Ilias Maglogiannis “Ensembling EfficientNets for the Classification and Interpretation of Histopathology Images” In Algorithms 14.10, 2021, pp. 278 DOI: 10.3390/a14100278
- “Taxi Trip Travel Time Prediction with Isolated XGBoost Regression” In 2019 Moratuwa Engineering Research Conference (MERCon) Moratuwa, Sri Lanka: IEEE, 2019, pp. 54–59 DOI: 10.1109/MERCon.2019.8818915
- Faten Khalifa, Asmaa Ali and Hatem Abdel-Kader “Improved Version of Explainable Decision Forest: Forest-Based Tree” In IJCI. International Journal of Computers and Information, 2022 DOI: 10.21608/ijci.2022.155977.1082
- “Multi-Task Representation Learning for Travel Time Estimation” In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining London United Kingdom: ACM, 2018, pp. 1695–1704 DOI: 10.1145/3219819.3220033
- Pantelis Linardatos, Vasilis Papastefanopoulos and Sotiris Kotsiantis “Explainable AI: A Review of Machine Learning Interpretability Methods” In Entropy 23.1, 2020, pp. 18 DOI: 10.3390/e23010018
- Scott M. Lundberg and Su-In Lee “A Unified Approach to Interpreting Model Predictions” In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17 Red Hook, NY, USA: Curran Associates Inc., 2017, pp. 4768–4777
- “RuleCOSI+: Rule Extraction for Interpreting Classification Tree Ensembles” In Information Fusion 89, 2023, pp. 355–381 DOI: 10.1016/j.inffus.2022.08.021
- Na Ren, Xin Zhao and Xin Zhang “Mortality Prediction in ICU Using a Stacked Ensemble Model” In Computational and Mathematical Methods in Medicine 2022, 2022, pp. 1–12 DOI: 10.1155/2022/3938492
- Marco Ribeiro, Sameer Singh and Carlos Guestrin ““Why Should I Trust You?”: Explaining the Predictions of Any Classifier” In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations San Diego, California: Association for Computational Linguistics, 2016, pp. 97–101 DOI: 10.18653/v1/N16-3020
- “The Shapley Value of Classifiers in Ensemble Games” In Proceedings of the 30th ACM International Conference on Information & Knowledge Management New York, NY, USA: Association for Computing Machinery, 2021, pp. 1558–1567
- Sören Schleibaum “Estimated Time of Arrival”, https://gitlab.tu-clausthal.de/ss16/stacked-eta-and-explanation, 2022
- Sören Schleibaum, Jörg P. Müller and Monika Sester “An Explainable Stacked Ensemble Model for Static Route-Free Estimation of Time of Arrival”, 2022 arXiv:2203.09438 [cs.LG]
- Sören Schleibaum, Jörg P. Müller and Monika Sester “Enhancing Expressiveness of Models for Static Route-Free Estimation of Time of Arrival in Urban Environments” In Transportation Research Procedia 62, 2022, pp. 432–441 DOI: 10.1016/j.trpro.2022.02.054
- Naziha Sendi, Nadia Abchiche-Mimouni and Farida Zehraoui “A New Transparent Ensemble Method Based on Deep Learning” In Procedia Computer Science 159, 2019, pp. 271–280 DOI: 10.1016/j.procs.2019.09.182
- Wilson Silva, Kelwin Fernandes and Jaime S. Cardoso “How to Produce Complementary Explanations Using an Ensemble Model” In 2019 International Joint Conference on Neural Networks (IJCNN) Budapest, Hungary: IEEE, 2019, pp. 1–8 DOI: 10.1109/IJCNN.2019.8852409
- “JSTC: Travel Time Prediction with a Joint Spatial-Temporal Correlation Mechanism” In Journal of Advanced Transportation 2022, 2022, pp. 1–16 DOI: 10.1155/2022/1213221
- “A Simple Baseline for Travel Time Estimation Using Large-scale Trip Data” In ACM Transactions on Intelligent Systems and Technology 10.2, 2019, pp. 1–22 DOI: 10.1145/3293317
- Yuelong Xia, Ke Chen and Yun Yang “Multi-Label Classification with Weighted Classifier Selection and Stacked Ensemble” In Information Sciences 557, 2021, pp. 421–442 DOI: 10.1016/j.ins.2020.06.017
- City of New York “TLC Trip Record Data”, https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page, 2019
- Zhiqiang Zou, Haoyu Yang and A-Xing Zhu “Estimation of Travel Time Based on Ensemble Method With Multi-Modality Perspective Urban Big Data” In IEEE Access 8, 2020, pp. 24819–24828 DOI: 10.1109/ACCESS.2020.2971008