Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
133 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Privacy risk in GeoData: A survey (2402.03612v2)

Published 6 Feb 2024 in cs.CR

Abstract: With the ubiquitous use of location-based services, large-scale individual-level location data has been widely collected through location-awareness devices. The widespread exposure of such location data poses significant privacy risks to users, as it can lead to re-identification, the inference of sensitive information, and even physical threats. In this survey, we analyse different geomasking techniques proposed to protect individuals' privacy in geodata. We propose a taxonomy to characterise these techniques across various dimensions. We then highlight the shortcomings of current techniques and discuss avenues for future research. Our proposed taxonomy serves as a practical resource for data custodians, offering them a means to navigate the extensive array of existing privacy mechanisms and to identify those that align most effectively with their specific requirements.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (123)
  1. Monsuru Adepeju. 2023. “stppSim”: a novel analytical tool for creating synthetic spatio-temporal point data. Open Journal of Modelling and Simulation 11, 04 (2023), 99–116.
  2. The utility of Zip4 codes in spatial epidemiological analysis. Plos one 18, 5 (2023), e0285552.
  3. Addressing the data guardian and geospatial scientist collaborator dilemma: how to share health records for spatial analysis while maintaining patient confidentiality. International Journal of Health Geographics 18 (2019), 1–12.
  4. Geomasking sensitive health data and privacy protection: an evaluation using an E911 database. Geocarto international 25, 6 (2010), 443–452.
  5. Geo-indistinguishability: Differential privacy for location-based systems. In SIGSAC conference on Computer & communications security. ACM, Berlin, Germany, 901–914.
  6. Luc Anselin. 1995. Local indicators of spatial association—LISA. Geographical analysis 27, 2 (1995), 93–115.
  7. Geographically masking health data to preserve confidentiality. Statistics in medicine 18, 5 (1999), 497–525.
  8. Differentially private clustering in high-dimensional euclidean spaces. In International Conference on Machine Learning. PMLR, Kyoto, Japan, 322–331.
  9. Santosh Kumar Bhoda. 2023. Exploring the Role of Geospatial Technology in Public Health. LinkedIn. https://www.linkedin.com/pulse/exploring-role-geospatial-technology-public-health-bhoda/ Published on LinkedIn.
  10. Measuring the impact of spatial perturbations on the relationship between data privacy and validity of descriptive statistics. International journal of health geographics 20 (2021), 1–16.
  11. An unsupervised classification method for inferring original case locations from low-resolution disease maps. International Journal of Health Geographics 5, 1 (2006), 1–7.
  12. No place to hide—reverse identification of patients from published maps. New England Journal of Medicine 355, 16 (2006), 1741–1742.
  13. A context-sensitive approach to anonymizing spatial surveillance data: impact on outbreak detection. Journal of the American Medical Informatics Association 13, 2 (2006), 160–165.
  14. Re-identification of home addresses from spatial locations anonymized by Gaussian skew. International journal of health geographics 7 (2008), 1–9.
  15. Laure Charleux and Katherine Schofield. 2020. True spatial k-anonymity: Adaptive areal elimination vs. adaptive areal masking. Cartography and Geographic Information Science 47, 6 (2020), 537–549.
  16. Balancing geo-privacy and spatial patterns in epidemiological studies. Geospatial health 12, 2 (2017), 294–299.
  17. An Improved Density Peak Clustering Algorithm Based on Chebyshev Inequality and Differential Privacy. Applied Sciences 13, 15 (2023), 8674.
  18. Keith C Clarke. 2016. A multiscale masking method for point geographic data. International Journal of Geographical Information Science 30, 2 (2016), 300–315.
  19. Differentially-private clustering of easy instances. In International Conference on Machine Learning. PMLR, California, USA, 2049–2059.
  20. Geographic monitoring for early disease detection (GeoMEDD). Scientific Reports 10, 1 (2020), 21753.
  21. Spatial confidentiality and GIS: re-engineering mortality locations from published maps about Hurricane Katrina. International Journal of Health Geographics 5 (2006), 1–12.
  22. Jack Cuzick and Robert Edwards. 1990. Spatial clustering for inhomogeneous populations. Journal of the Royal Statistical Society Series B: Statistical Methodology 52, 1 (1990), 73–96.
  23. Unique in the crowd: The privacy bounds of human mobility. Scientific reports 3, 1 (2013), 1–5.
  24. Philip Dixon. 2001. Ripley’s K function. Encyclopedia of Environmetrics 3, 1 (2001), 1796––1803.
  25. Jörg Drechsler and Jingchen Hu. 2021. Synthesizing geocodes to facilitate access to detailed geographical information in large-scale administrative data. Journal of Survey Statistics and Methodology 9, 3 (2021), 523–548.
  26. Geospatial Data Disclosure Avoidance and the Census. Select Topics in International Censuses (STIC 1, 1 (2022), 1–9.
  27. C. Dwork. 2006. Differential privacy. In International Colloquium on Automata, Languages and Programming. Springer, Venice, Italy, 1–12.
  28. Cynthia Dwork. 2008. Differential privacy: A survey of results. In Theory and Applications of Models of Computation. Springer, Xian, China, 1–19.
  29. Samuel N Eshun and Paolo Palmieri. 2022. Two de-anonymization attacks on real-world location data based on a hidden Markov model. In European Symposium on Security and Privacy Workshops (Euro S&PW). IEEE, Genoa, Italy, 01–09.
  30. Brian FG Fabrègue and Andrea Bogoni. 2023. Privacy and Security Concerns in the Smart City. Smart Cities 6, 1 (2023), 586–613.
  31. TR Fanshawe and PJ Diggle. 2011. Spatial prediction in the presence of positional error. Environmetrics 22, 2 (2011), 109–122.
  32. Fazlay S Faruque. 2022. Correction to: Geospatial Technology for Human Well-Being and Health. In Geospatial Technology for Human Well-Being and Health. Springer, Switzerland, C1–C1.
  33. Claudio Fronterrè et al. 2018. Spatial analysis of geomasked and aggregated data. Ph. D. Dissertation. Università degli studi di Padova.
  34. De-anonymization attack on geolocated data. J. Comput. System Sci. 80, 8 (2014), 1597–1614.
  35. Exploring the effectiveness of geomasking techniques for protecting the geoprivacy of Twitter users. Journal of Spatial Information Science 19, 1 (2019), 105–129.
  36. Differentially private clustering: Tight approximation ratios. Advances in Neural Information Processing Systems 33 (2020), 4040–4054.
  37. Chandan Ghosh. 2023. GIS and Geospatial Studies in disaster management. In International Handbook of Disaster Research. Springer, Singapore, 701–708.
  38. Ruchika Gupta and Udai Pratap Rao. 2020. Preserving location privacy using three layer RDV masking in geocoded published discrete point data. World Wide Web 23, 1 (2020), 175–206.
  39. Mapping health data: improved privacy protection with donut method geomasking. American journal of epidemiology 172, 9 (2010), 1062–1069.
  40. Daniel R Harris. 2020. Leveraging differential privacy in geospatial analyses of standardized healthcare data. In International Conference on Big Data (Big Data). IEEE, Online, 3119–3122.
  41. An effective grouping method for privacy-preserving bike sharing data publishing. Future Internet 9, 4 (2017), 65.
  42. ‘Unmasking’ masked address data: A medoid geocoding solution. MethodsX 10 (2023), 102090.
  43. Collecting Geospatial Data Under Local Differential Privacy With Improving Frequency Estimation. IEEE Transactions on Knowledge and Data Engineering 35, 4 (2022), 6739–6751.
  44. Geographically masking addresses to study COVID-19 clusters. Cartography and Geographic Information Science 49, 5 (2021), 1–15.
  45. Jingchen Hu. 2019. Bayesian estimation of attribute and identification disclosure risks in synthetic data. TRANSACTIONS ON DATA PRIVACY 12, 2 (2019), 61–89.
  46. Jingchen Hu and Jörg Drechsler. 2015. Generating synthetic geocoding information for public release. In NTTS-Conferences on New Techniques and Technologies for Statistics. Eurostat, Brussels, 1–15.
  47. Advancing social and environmental research in cancer registries using geomasking for address-level data. Cancer Epidemiology, Biomarkers & Prevention 32, 11 (2023), 1485–1489.
  48. Differentially private clustering via maximum coverage. In AAAI Conference on Artificial Intelligence. AAAI, Online, 11555–11563.
  49. Reconciling public health common good and individual privacy: new methods and issues in geoprivacy. International Journal of Health Geographics 21, 1 (2022), 1.
  50. Dime Kekana. 2020. A way to use GIS (incl. geomasking) to understand homelessness: a focus on the spatial characteristics of and around sleeping locations of the homeless in Cape Town City Bowl. Master’s thesis. Faculty of Engineering and the Built Environment.
  51. Alexandra Kicior. 2023. Emergency Management, GIS, & Data Privacy. Ethical Geo. https://ethicalgeo.org/emergency-management-gis-data-privacy/
  52. How do people perceive the disclosure risk of maps? Examining the perceived disclosure risk of maps and its implications for geoprivacy protection. Cartography and Geographic Information Science 48, 1 (2021), 2–20.
  53. S. van der Klei. 2019. Role of Geospatial Technologies in Building Smart Cities. Master’s thesis. Utrecht University.
  54. Releasing survey microdata with exact cluster locations and additional privacy safeguards. Humanities and Social Sciences Communications 10, 1 (2023), 1–13.
  55. Accuracy and privacy aspects in free online reverse geocoding services. Cartography and Geographic Information Science 40, 2 (2013), 140–153.
  56. Ourania Kounadi and Michael Leitner. 2015. Spatial information divergence: Using global and local indices to compare geographical masks applied to crime data. Transactions in GIS 19, 5 (2015), 737–757.
  57. Ourania Kounadi and Michael Leitner. 2016. Adaptive areal elimination (AAE): A transparent way of disclosing protected spatial datasets. Computers, Environment and Urban Systems 57 (2016), 59–67.
  58. Martin Kroll. 2015. A graph theoretic linkage attack on microdata in a metric space. Transactions on Data Privacy 8, 3 (2015), 217–243.
  59. M Kulldorff. 2006. Information Management Services Inc: SaTScan v. 7.0: Software for the spatial and space-time scan statistics. Technical Report. Tech rep.
  60. Protection of geoprivacy and accuracy of spatial information: How effective are geographical masks? Cartographica: The International Journal for Geographic Information and Geovisualization 39, 2 (2004), 15–28.
  61. A quadtree approach based on European geographic grids: reconciling data privacy and accuracy. Statistics and Operations Research Transactions 41, 1 (2017), 139–158.
  62. Michael Leitner and Andrew Curtis. 2004. Cartographic guidelines for geographically masking the locations of confidential point data. Cartographic Perspectives 49, 1 (2004), 22–39.
  63. Moritz Leitner and Andrew Curtis. 2006. A first step towards a framework for presenting the location of confidential point data on maps—results of an empirical perceptual study. International Journal of Geographical Information Science 20, 7 (2006), 813–822.
  64. Zhenlong Li and Huan Ning. 2023. Autonomous GIS: the next-generation AI-powered GIS. International Journal of Digital Earth 16, 2 (2023), 4668–4686.
  65. GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding. In Conference on Empirical Methods in Natural Language Processing, Houda Bouamor, Juan Pino, and Kalika Bali (Eds.). Association for Computational Linguistics, Singapore, 5227–5240.
  66. Geomasking through perturbation, or counting points in circles. In 33rd European Workshop on Computational Geometry. EuroCG, Malmö, Sweden, 209–212.
  67. Considering risk locations when defining perturbation zones for geomasking. Cartographica: The International Journal for Geographic Information and Geovisualization 47, 3 (2012), 168–178.
  68. Zhigang Lu and Hong Shen. 2020. Differentially Private k𝑘kitalic_k k-Means Clustering With Convergence Guarantee. IEEE Transactions on Dependable and Secure Computing 18, 4 (2020), 1541–1552.
  69. City Geospatial Dashboard: IoT and Big Data Analytics for Geospatial Solutions Provider in Disaster Management. In International Conference on Information and Communication Technologies for Disaster Management (ICT-DM). IEEE, Paris, France, 1–4. https://doi.org/10.1109/ICT-DM47966.2019.9032921
  70. PrivyTo: A privacy-preserving location-sharing platform. Transactions in GIS 26, 4 (2022), 1703–1717.
  71. Towards Geospatial Foundation Models via Continual Pretraining. In IEEE/CVF International Conference on Computer Vision. IEEE, Paris, France, 16806–16816.
  72. Towards Understanding the Geospatial Skills of ChatGPT: Taking a Geographic Information Systems (GIS) Exam. In Proceedings of the 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery. ACM, Germany, 85–94.
  73. Patrick AP Moran. 1950. Notes on continuous stochastic phenomena. Biometrika 37, 1/2 (1950), 17–23.
  74. Protecting patient geo-privacy via a triangular displacement geo-masking method. In 1st sigspatial international workshop on privacy in geographic information collection and analysis. ACM, Texas, USA, 1–9.
  75. Utility-efficient differentially private K-means clustering based on cluster merging. Neurocomputing 424 (2021), 205–214.
  76. Parvaneh Nowbakht. 2022. Geoprivacy protection of agricultural data. The Boolean 6 (2022), 179–183.
  77. A comparison of obfuscation methods used for privacy protection: Exploring the challenges of polygon data in agricultural research. Transactions in GIS 26, 2 (2022), 949–979.
  78. Beata Nowok and Chris Dibben. 2018. Putting synthetic people in place: creating synthetic data for spatial analysis at the individual level. QCumber-EnvHealth project: WP3 Accessible health data 1, 1 (2018), 1–12.
  79. Applications of geospatial and big data technologies in smart farming. In Smart Agriculture for Developing Nations: Status, Perspectives and Challenges. Springer, Singapore, 15–31.
  80. Juan Pablo Duque and Angelly Pugliese. 2023. Geoprivacy Plugin: A set of location privacy tools for geographic data. https://diuke.github.io/GeoPrivPlugin/.
  81. Prem Chandra Pandey and Manish Pandey. 2023. Highlighting the role of agriculture and geospatial technology in food security and sustainable development goals. Sustainable Development 31, 5 (2023), 3175–3195.
  82. Fiona S Polzin. 2020. Adaptive Voronoi Masking. A method to protect confidential discrete spatial data. Master’s thesis. University of Twente,.
  83. Proxi. n.d.. MapsGPT: Building Custom Maps With The Power Of AI. Proxi. https://www.proxi.co/blog/guide-to-mapsgpt
  84. Differentially private grids for geospatial data. In 29th international conference on data engineering (ICDE). IEEE, Brisbane, Australia, 757–768.
  85. Harrison Quick. 2021. Generating Poisson-distributed differentially private synthetic data. Journal of the Royal Statistical Society Series A: Statistics in Society 184, 3 (2021), 1093–1108.
  86. Bayesian marked point process modeling for generating fully synthetic public use data with point-referenced geography. Spatial Statistics 14 (2015), 439–451.
  87. Harrison Quick and Lance A Waller. 2018. Using spatiotemporal models to generate synthetic data for public use. Spatial and Spatio-Temporal Epidemiology 27 (2018), 37–45.
  88. Sriram Raghavan and Christina Shim. 2023. Earth’s climate is changing. IBM’s new geospatial foundation model could help track and adapt to a new landscape. IBM Corporation. https://www.preventionweb.net/news/earths-climate-changing-ibms-new-geospatial-foundation-model-could-help-track-and-adapt-new
  89. Building Privacy-Preserving and Secure Geospatial Artificial Intelligence Foundation Models (Vision Paper). In International Conference on Advances in Geographic Information Systems. ACM, Hamburg, Germany, 1–4.
  90. Sarah Redlich. 2022. Quantitative Analysis of Geomasking Methods. Ph. D. Dissertation. University Duisburg-Essen.
  91. Geospatial Artificial Intelligence (GeoAI): Applications in Health Care. International Journal of Health and Allied Sciences 11, 4 (2022), 4.
  92. Joseph W Sakshaug and Trivellore E Raghunathan. 2014. Generating synthetic microdata to estimate small area statistics in the American Community Survey. STATISTICS 15, 3 (2014), 341––368.
  93. Privacy and false identification risk in geomasking techniques. Geographical Analysis 50, 3 (2018), 280–297.
  94. Spatial obfuscation methods for privacy protection of household-level data. Applied Geography 63 (2015), 253–263.
  95. Chancy Shah. 2023. Revolutionizing Healthcare with Geospatial Intelligence. Technical Report. OSINT TEAM.
  96. Kumar Sharad and George Danezis. 2013. De-anonymizing d4d datasets. In Workshop on hot topics in privacy enhancing technologies. Springer, Indiana, USA, 10–10.
  97. MapSafe: A complete tool for achieving geospatial data sovereignty. Transactions in GIS 27, 6 (2023), 1680–1698.
  98. An overview of data protection strategies for individual-level geocoded data. Technical Report. United Nations Economic Commission For Europe Conference Of European Statisticians.
  99. Dave Stinchcomb. 2004. Procedures for geomasking to protect patient confidentiality. In ESRI international health GIS conference. ESRI, Washington D.C., USA, 17–20.
  100. Ben Strong. 2023. ChatGeoPT: Exploring the future of talking to our maps. Technical Report. Earth Genome.
  101. Differentially private k-means clustering and a hybrid approach to private optimization. ACM Transactions on Privacy and Security (TOPS) 20, 4 (2017), 1–33.
  102. Data Privacy in Disaster Situation: A review. In International Conference on ICT for Smart Society (ICISS). IEEE, Indonesia, 1–4.
  103. MaskMy. XYZ: An easy-to-use tool for protecting geoprivacy using geographic masks. Transactions in GIS 24, 2 (2020), 390–401.
  104. Street masking: a network-based geographic mask for easily protecting geoprivacy. International Journal of Health Geographics 19 (2020), 1–11.
  105. Ran Tao and Jinwen Xu. 2023. Mapping with chatgpt. ISPRS International Journal of Geo-Information 12, 7 (2023), 284.
  106. Exploring geomasking methods for geoprivacy: a pilot study in an environment with built features. Geospatial Health 18, 2 (2023), 1–8.
  107. Jayakrishnan Unnikrishnan and Farid Movahedi Naini. 2013. De-anonymizing private data by matching statistics. In Annual Allerton Conference on Communication, Control, and Computing. IEEE, Illinois, USA, 1616–1623.
  108. A graph matching attack on privacy-preserving record linkage. In International Conference on Information & Knowledge Management. ACM, Virtual, 1485–1494.
  109. An exploratory assessment of the effectiveness of geomasking methods on privacy protection and analytical accuracy for individual-level geospatial data. Cartography and Geographic Information Science 49, 5 (2022), 385–406.
  110. Location privacy-preserving task allocation for mobile crowdsensing with differential geo-obfuscation. In International Conference on World Wide Web. IEEE, Perth, Australia, 627–636.
  111. A de-anonymization attack on geo-located data considering spatio-temporal influences. In International Conference Information and Communications Security. Springer, Beijing, China, 478–484.
  112. Alan F Westin. 1968. Privacy and freedom. Washington and Lee Law Review 25, 1 (1968), 166.
  113. Yonghui Xiao and Li Xiong. 2015. Protecting locations with differential privacy under temporal correlations. In SIGSAC Conference on Computer and Communications Security. ACM, Colorado, USA, 1298–1309.
  114. Achieving Differential Privacy Publishing of Location-Based Statistical Data Using Grid Clustering. ISPRS International Journal of Geo-Information 11, 7 (2022), 404.
  115. K-Means Clustering With Local dx-Privacy for Privacy-Preserving Data Analysis. IEEE Transactions on Information Forensics and Security 17 (2022), 2524–2537.
  116. Robert S Yuill. 1971. The standard deviational ellipse; an updated tool for spatial description. Geografiska Annaler: Series B, Human Geography 53, 1 (1971), 28–39.
  117. Paul A Zandbergen. 2014. Ensuring confidentiality of geocoded health data: assessing geographic masking strategies for individual-level data. Advances in medicine 2014 (2014), 1–14.
  118. The Ethics of AI-Generated Maps: DALL·E 2 and AI’s Implications for Cartography. In 12th International Conference on Geographic Information Science (GIScience 2023). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 93:1–93:6.
  119. The location swapping method for geomasking. Cartography and Geographic Information Science 44, 1 (2017), 22–34.
  120. GeoGPT: Understanding and Processing Geospatial Tasks through An Autonomous GPT. arXiv preprint arXiv:2307.07930 0, 0 (2023), 1–23.
  121. MPDP k-medoids: Multiple partition differential privacy preserving k-medoids clustering for data publishing in the Internet of Medical Things. International Journal of Distributed Sensor Networks 17, 10 (2021), 15501477211042543.
  122. Mayra Zurbaran and Pedro Wightman. 2017. VoKA: Voronoi K-aggregation mechanism for privacy in location-based information systems. In International Carnahan Conference on Security Technology (ICCST). IEEE, Madrid, Spain, 1–6.
  123. NRand-K: Minimizing the impact of location obfuscation in spatial analysis. Transactions in GIS 22, 5 (2018), 1257–1274.
Citations (2)

Summary

We haven't generated a summary for this paper yet.