Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Reconsidering utility: unveiling the limitations of synthetic mobility data generation algorithms in real-life scenarios (2407.03237v1)

Published 3 Jul 2024 in cs.CR

Abstract: In recent years, there has been a surge in the development of models for the generation of synthetic mobility data. These models aim to facilitate the sharing of data while safeguarding privacy, all while ensuring high utility and flexibility regarding potential applications. However, current utility evaluation methods fail to fully account for real-life requirements. We evaluate the utility of five state-of-the-art synthesis approaches, each with and without the incorporation of differential privacy (DP) guarantees, in terms of real-world applicability. Specifically, we focus on so-called trip data that encode fine granular urban movements such as GPS-tracked taxi rides. Such data prove particularly valuable for downstream tasks at the road network level. Thus, our initial step involves appropriately map matching the synthetic data and subsequently comparing the resulting trips with those generated by the routing algorithm implemented in OpenStreetMap, which serves as an efficient and privacy-friendly baseline. Out of the five evaluated models, one fails to produce data within reasonable computation time and another generates too many jumps to meet the requirements for map matching. The remaining three models succeed to a certain degree in maintaining spatial distribution, one even with DP guarantees. However, all models struggle to produce meaningful sequences of geo-locations with reasonable trip lengths and to model traffic flow at intersections accurately. It is important to note that trip data encompasses various relevant characteristics beyond spatial distribution, such as temporal information, all of which are discarded by these models. Consequently, our results imply that current synthesis models fall short in their promise of high utility and flexibility.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Deep Learning with Differential Privacy. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security (CCS ’16). Association for Computing Machinery, New York, NY, USA, 308–318. https://doi.org/10.1145/2976749.2978318
  2. David Bermbach. 2021. SimRa Rides Berlin 06/19 - 12/20. (Feb. 2021). https://doi.org/10.14279/DEPOSITONCE-10605
  3. David Bermbach and Ahmet-Serdar Karakaya. 2021. SimRa Rides Berlin 01/21 - 09/21. (Oct. 2021). https://doi.org/10.14279/DEPOSITONCE-12452
  4. Generation of Synthetic Trajectory Microdata from Language Models. In Privacy in Statistical Databases (Lecture Notes in Computer Science), Josep Domingo-Ferrer and Maryline Laurent (Eds.). Springer International Publishing, Cham, 172–187. https://doi.org/10.1007/978-3-031-13945-1_13
  5. Where Do Cyclists Ride? A Route Choice Model Developed with Revealed Preference GPS Data. Transportation Research Part A: Policy and Practice 46, 10 (Dec. 2012), 1730–1740. https://doi.org/10.1016/j.tra.2012.07.005
  6. TrajGAIL: Generating Urban Vehicle Trajectories Using Generative Adversarial Imitation Learning. Transportation Research Part C: Emerging Technologies 128 (2021), 103091. https://doi.org/10.1016/j.trc.2021.103091
  7. Cynthia Dwork. 2006. Differential Privacy. In Automata, Languages and Programming (Lecture Notes in Computer Science), Michele Bugliesi, Bart Preneel, Vladimiro Sassone, and Ingo Wegener (Eds.). Springer, Berlin, Heidelberg, 1–12. https://doi.org/10.1007/11787006_1
  8. Cynthia Dwork and Aaron Roth. 2014. The Algorithmic Foundations of Differential Privacy.
  9. Utility-Aware Synthesis of Differentially Private and Attack-Resilient Location Traces. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security. 196–211. https://doi.org/10.1145/3243734.3243741
  10. DPT: Differentially Private Trajectory Synthesis Using Hierarchical Reference Systems. Proceedings of the VLDB Endowment 8, 11 (2015), 1154–1165. https://doi.org/10.14778/2809974.2809978
  11. Alexandra Kapp. 2022. Collection, Usage and Privacy of Mobility Data in the Enterprise and Public Administrations. Proceedings on Privacy Enhancing Technologies 2022, 4 (Oct. 2022), 440–456. https://doi.org/10.56553/popets-2022-0117
  12. Ahmet-Serdar Karakaya and David Bermbach. 2022. SimRa Rides Berlin 10/21 - 09/22. https://doi.org/10.14279/DEPOSITONCE-16439
  13. SimRa: Using Crowdsourcing to Identify near Miss Hotspots in Bicycle Traffic. Pervasive and Mobile Computing 67 (Sept. 2020), 101197. https://doi.org/10.1016/j.pmcj.2020.101197
  14. In Search of Lost Utility: Private Location Data. Proceedings on Privacy Enhancing Technologies 2022, 3 (2022), 354–372. https://doi.org/10.56553/popets-2022-0076
  15. Deriving Features of Traffic Flow around an Intersection from Trajectories of Vehicles. In 2010 18th International Conference on Geoinformatics. 1–5. https://doi.org/10.1109/GEOINFORMATICS.2010.5567483
  16. Understanding Bike Share Cyclist Route Choice Using GPS Data: Comparing Dominant Routes and Shortest Paths. Journal of Transport Geography 71 (July 2018), 172–181. https://doi.org/10.1016/j.jtrangeo.2018.07.012
  17. A Survey on Deep Learning for Human Mobility. ACM Computing Surveys (CSUR) 55, 1 (2021), 1–44.
  18. Dennis Luxen and Christian Vetter. 2011. Real-Time Routing with OpenStreetMap Data. In Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS ’11). ACM, New York, NY, USA, 513–516. https://doi.org/10.1145/2093973.2094062
  19. Carlo Giacomo Prato. 2009. Route Choice Modeling: Past, Present and Future Research Directions. Journal of Choice Modelling 2, 1 (Jan. 2009), 65–100.
  20. Theresa Stadler and Carmela Troncoso. 2022. Why the Search for a Privacy-Preserving Data Sharing Mechanism Is Failing. Nature Computational Science 2, 4 (2022), 208–210. https://doi.org/10.1038/s43588-022-00236-x
  21. Synthesizing Realistic Trajectory Data With Differential Privacy. IEEE Transactions on Intelligent Transportation Systems (2023), 1–14.
  22. PrivTrace: Differentially Private Trajectory Synthesis by Adaptive Markov Model. In USENIX Security Symposium 2023.
  23. A Survey on Traffic Signal Control Methods. (Jan. 2020). arXiv:1904.08117 [cs, stat]
Citations (1)

Summary

We haven't generated a summary for this paper yet.