Semantic Trajectory Data Mining with LLM-Informed POI Classification (2405.11715v2)
Abstract: Human travel trajectory mining is crucial for transportation systems, enhancing route optimization, traffic management, and the study of human travel patterns. Previous rule-based approaches without the integration of semantic information show a limitation in both efficiency and accuracy. Semantic information, such as activity types inferred from Points of Interest (POI) data, can significantly enhance the quality of trajectory mining. However, integrating these insights is challenging, as many POIs have incomplete feature information, and current learning-based POI algorithms require the integrity of datasets to do the classification. In this paper, we introduce a novel pipeline for human travel trajectory mining. Our approach first leverages the strong inferential and comprehension capabilities of LLMs to annotate POI with activity types and then uses a Bayesian-based algorithm to infer activity for each stay point in a trajectory. In our evaluation using the OpenStreetMap (OSM) POI dataset, our approach achieves a 93.4% accuracy and a 96.1% F-1 score in POI classification, and a 91.7% accuracy with a 92.3% F-1 score in activity inference.
- X. Wan, M. C. Lucic, H. Ghazzai, and Y. Massoud, “Empowering real-time traffic reporting systems with nlp-processed social media data,” IEEE Open Journal of Intelligent Transportation Systems, vol. 1, pp. 159–175, 2020.
- V. Papathanasopoulou, I. Spyropoulou, H. Perakis, V. Gikas, and E. Andrikopoulou, “A data-driven model for pedestrian behavior classification and trajectory prediction,” IEEE Open Journal of Intelligent Transportation Systems, vol. 3, pp. 328–339, 2022.
- T. M. Bojan, U. R. Kumar, and V. M. Bojan, “An internet of things based intelligent transportation system,” in 2014 IEEE international conference on vehicular electronics and safety. IEEE, 2014, pp. 174–179.
- F. Bouali, J. Pinola, V. Karyotis, B. Wissingh, M. Mitrou, P. Krishnan, and K. Moessner, “5g for vehicular use cases: Analysis of technical requirements, value propositions and outlook,” IEEE Open Journal of Intelligent Transportation Systems, vol. 2, pp. 73–96, 2021.
- A. Kose, H. Lee, C. H. Foh, and M. Dianati, “Beam-based mobility management in 5g millimetre wave v2x communications: A survey and outlook,” IEEE open journal of intelligent transportation systems, vol. 2, pp. 347–363, 2021.
- C. Pasquaretta, T. Dubois, T. Gomez-Moracho, V. P. Delepoulle, G. Le Loc’h, P. Heeb, and M. Lihoreau, “Analysis of temporal patterns in animal movement networks,” Methods in Ecology and Evolution, vol. 12, no. 1, pp. 101–113, 2021.
- Z. Ghandeharioun and A. Kouvelas, “Link travel time estimation for arterial networks based on sparse gps data and considering progressive correlations,” IEEE Open Journal of Intelligent Transportation Systems, vol. 3, pp. 679–694, 2022.
- S. E. Wiehe, A. E. Carroll, G. C. Liu, K. L. Haberkorn, S. C. Hoch, J. S. Wilson, and J. Fortenberry, “Using gps-enabled cell phones to track the travel patterns of adolescents,” International journal of health geographics, vol. 7, pp. 1–11, 2008.
- J. Ugan, M. Abdel-Aty, and Z. Islam, “Using connected vehicle trajectory data to evaluate the effects of speeding,” IEEE Open Journal of Intelligent Transportation Systems, 2023.
- Z. Kan, L. Tang, M.-P. Kwan, C. Ren, D. Liu, and Q. Li, “Traffic congestion analysis at the turn level using taxis’ gps trajectory data,” Computers, Environment and Urban Systems, vol. 74, pp. 229–243, 2019.
- B. Benreguia, H. Moumen, and M. A. Merzoug, “Tracking covid-19 by tracking infectious trajectories,” Ieee Access, vol. 8, pp. 145 242–145 255, 2020.
- K. Siła-Nowicka, J. Vandrol, T. Oshan, J. A. Long, U. Demšar, and A. S. Fotheringham, “Analysis of human mobility patterns from gps trajectories and contextual information,” International Journal of Geographical Information Science, vol. 30, no. 5, pp. 881–906, 2016.
- J. Teusch, J. N. Gremmel, C. Koetsier, F. T. Johora, M. Sester, D. M. Woisetschläger, and J. P. Müller, “A systematic literature review on machine learning in shared mobility,” IEEE Open Journal of Intelligent Transportation Systems, vol. 4, pp. 870–899, 2023.
- M. Ruta, F. Scioscia, S. Ieva, G. Loseto, and E. Di Sciascio, “Semantic annotation of openstreetmap points of interest for mobile discovery and navigation,” in 2012 IEEE First International Conference on Mobile Services. IEEE, 2012, pp. 33–39.
- W. Souffriau and P. Vansteenwegen, “Tourist trip planning functionalities: State–of–the–art and future,” in International Conference on Web Engineering. Springer, 2010, pp. 474–485.
- J. Bao, C. Xu, P. Liu, and W. Wang, “Exploring bikesharing travel patterns and trip purposes using smart card data and online point of interests,” Networks and Spatial Economics, vol. 17, pp. 1231–1253, 2017.
- E. Thonhofer, S. Sigl, M. Fischer, F. Heuer, A. Kuhn, J. Erhart, M. Harrer, and W. Schildorfer, “Infrastructure-based digital twins for cooperative, connected, automated driving and smart road services,” IEEE Open Journal of Intelligent Transportation Systems, 2023.
- M. Haklay and P. Weber, “Openstreetmap: User-generated street maps,” IEEE Pervasive computing, vol. 7, no. 4, pp. 12–18, 2008.
- M. A. Hearst, S. T. Dumais, E. Osuna, J. Platt, and B. Scholkopf, “Support vector machines,” IEEE Intelligent Systems and their applications, vol. 13, no. 4, pp. 18–28, 1998.
- S. J. Choi, H. J. Song, S. B. Park, and S. J. Lee, “A poi categorization by composition of onomastic and contextual information,” in 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), vol. 2. IEEE, 2014, pp. 38–45.
- G. Giannopoulos, K. Alexis, N. Kostagiolas, and D. Skoutas, “Classifying points of interest with minimum metadata,” in Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-based Recommendations, Geosocial Networks and Geoadvertising, 2019, pp. 1–4.
- G. Giannopoulos, N. Karagiannakis, D. Skoutas, and S. Athanasiou, “Learning to classify spatiotextual entities in maps,” in The Semantic Web. Latest Advances and New Domains: 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29–June 2, 2016, Proceedings 13. Springer, 2016, pp. 539–555.
- C. Zhou, H. Yang, J. Zhao, and X. Zhang, “Poi classification method based on feature extension and deep learning,” Journal of Advanced Computational Intelligence and Intelligent Informatics, vol. 24, no. 7, pp. 944–952, 2020.
- V. Milias and A. Psyllidis, “Assessing the influence of point-of-interest features on the classification of place categories,” Computers, Environment and Urban Systems, vol. 86, p. 101597, 2021.
- L. Zheng, W.-L. Chiang, Y. Sheng, S. Zhuang, Z. Wu, Y. Zhuang, Z. Lin, Z. Li, D. Li, E. Xing et al., “Judging llm-as-a-judge with mt-bench and chatbot arena,” Advances in Neural Information Processing Systems, vol. 36, 2024.
- J. Mai, J. Chen, G. Qian, M. Elhoseiny, B. Ghanem et al., “Llm as a robotic brain: Unifying egocentric memory and control,” 2023.
- J. Cui, Z. Li, Y. Yan, B. Chen, and L. Yuan, “Chatlaw: Open-source legal large language model with integrated external knowledge bases,” arXiv preprint arXiv:2306.16092, 2023.
- E. Nijkamp, B. Pang, H. Hayashi, L. Tu, H. Wang, Y. Zhou, S. Savarese, and C. Xiong, “Codegen: An open large language model for code with multi-turn program synthesis,” arXiv preprint arXiv:2203.13474, 2022.
- J. Achiam, S. Adler, S. Agarwal, L. Ahmad, I. Akkaya, F. L. Aleman, D. Almeida, J. Altenschmidt, S. Altman, S. Anadkat et al., “Gpt-4 technical report,” arXiv preprint arXiv:2303.08774, 2023.
- T. Bhattacharya, L. Kulik, and J. Bailey, “Automatically recognizing places of interest from unreliable GPS data using spatio-temporal density estimation and line intersections,” Pervasive and Mobile Computing, vol. 19, pp. 86–107, 2015.
- S. Isaacman et al., “Identifying important places in people’s lives from cellular network data,” in Pervasive Computing, K. Lyons, E. M. Hightower, and J. Hightower, Eds. Springer, 2011, pp. 133–151.
- L. Alexander, S. Jiang, M. Murga, and M. C. González, “Origin–destination trips by purpose and time of day inferred from mobile phone data,” Transportation Research Part C, vol. 58, pp. 240–250, 2015.
- X. Liu, M. Wu, B. Peng, and Q. Huang, “Graph-based representation for identifying individual travel activities with spatiotemporal trajectories and poi data,” Scientific Reports, 2023, published on www.nature.com/scientificreports.
- C. Meng, Y. Cui, Q. He, L. Su, and J. Gao, “Travel purpose inference with gps trajectories, pois, and geo-tagged social media data,” in 2017 IEEE International Conference on Big Data (BIGDATA), 2017.
- S. Jiang, Y. Yang, S. Gupta et al., “The timegeo modeling framework for urban mobility without travel surveys,” Proceedings of the National Academy of Sciences, vol. 113, no. 37, pp. E5370–E5378, 2016.
- M. Gruteser and D. Grunwald, “Anonymous usage of location-based services through spatial and temporal cloaking,” in Proceedings of the 1st International Conference on Mobile Systems, Applications, and Services (MobiSys), 2003, pp. 31–42.
- National Renewable Energy Laboratory, “Transportation Secure Data Center,” Accessed Jan. 15, 2019, 2019. [Online]. Available: https://www.nrel.gov/tsdc
- U.S. Department of Defense, “GPS standard positioning service (SPS) performance standard,” April 2020, 5th Edition.