Data Science for Geographic Information Systems (2404.03754v3)
Abstract: The integration of data science into Geographic Information Systems (GIS) has facilitated the evolution of these tools into complete spatial analysis platforms. The adoption of machine learning and big data techniques has equipped these platforms with the capacity to handle larger amounts of increasingly complex data, transcending the limitations of more traditional approaches. This work traces the historical and technical evolution of data science and GIS as fields of study, highlighting the critical points of convergence between domains, and underlining the many sectors that rely on this integration. A GIS application is presented as a case study in the disaster management sector where we utilize aerial data from Tr\'oia, Portugal, to emphasize the process of insight extraction from raw data. We conclude by outlining prospects for future research in integration of these fields in general, and the developed application in particular.
- [Dataset] Akhund, S. (2022). Analysis of spatial big data for geographical information systems. 10.13140/RG.2.2.20522.70080/2
- A real-time hydrological model for flood prediction using gis and the www. Computers, Environment and Urban Systems 27, 9–32
- Borges, L. R. (1989). Analysis of the wisconsin breast cancer dataset and machine learning for breast cancer detection. Group 1, 15–19
- Principles of geographical information systems (Oxford University Press, USA)
- Machine learning for spatial analyses in urban areas: a scoping review. Sustainable Cities and Society 85, 104050
- Chang, K.-T. (2008). Introduction to geographic information systems, vol. 4 (Mcgraw-hill Boston)
- Machine learning algorithms for urban land use planning: A review. Urban Science 5, 68
- Data warehousing and olap for decision support. In Proceedings of the 1997 ACM SIGMOD international conference on Management of Data. 507–508
- Geospatial data analysis through artificial intelligence: editorial column. GeoJournal , 1–2
- Geographical data science and spatial data analysis: an introduction in R (Sage)
- Dhar, V. (2013). Data science and prediction. Communications of the ACM 56, 64–73
- Diakopoulos, N. (2016). Accountability in algorithmic decision making. Communications of the ACM 59, 56–62
- A rapid review on the use of free and open source technologies and software applied to precision agriculture practices. Journal of Sensor and Actuator Networks 12, 28
- Fachada, N. (2022). A computational pipeline for modeling and predicting wildfire behavior. In Proceedings of the 7th International Conference on Complexity, Future Information Systems and Risk - COMPLEXIS. INSTICC (SciTePress), 79–84. 10.5220/0011073900003197
- From data mining to knowledge discovery in databases. AI magazine 17, 37–37
- Modelos de combustível florestal para portugal - documento de referência, versão de 2021
- Flach, P. (2012). Machine learning: the art and science of algorithms that make sense of data (Cambridge university press)
- Freedman, D. A. (2009). Statistical models: theory and practice (cambridge university press)
- Machine learning methods for earthquake prediction: A survey. In Proceedings of the Fourth Conference on Software Engineering and Information Management (SEIM-2019), Saint Petersburg, Russia. vol. 13, 25
- Gis and image understanding for near-real-time earthquake damage assessment. Photogrammetric engineering and remote sensing 64, 987–994
- Gardner, S. R. (1998). Building the data warehouse. Communications of the ACM 41, 52–60
- Garrard, C. (2016). Geoprocessing with python (Simon and Schuster)
- Goodchild, M. F. (2016). Gis in the era of big data. Cybergeo: European journal of geography
- Deep learning, volume 1
- A review of large area monitoring of land cover change using landsat data. Remote sensing of Environment 122, 66–74
- The rise of big data on urban studies and planning practices in china: Review and open research issues. Journal of Urban Management 4, 92–124
- Distributed frameworks and parallel algorithms for processing large-scale geographic data. Parallel Computing 29, 1297–1333
- Genotype×\times× environment×\times× management interactions of canola across china: A simulation study. Agricultural and Forest Meteorology 247, 424–433
- Remote sensing for ecology and conservation: a handbook of techniques (Oxford University Press)
- Mobile geographic information systems: a case study on mansoura university, egypt. International journal of computer science & information technology 3, 173
- R: A language for data analysis and graphics. journal of computational and graphical statistics 5: 299. doi: 10.2307/1390807
- A review of machine learning applications in wildfire science and management. Environmental Reviews 28, 478–505
- Parallel and distributed gis for processing geo-data: An overview. International Journal of Computer Applications 106, 9–16
- Geographical tracking and mapping of coronavirus disease covid-19/severe acute respiratory syndrome coronavirus 2 (sars-cov-2) epidemic and associated events around the world: how 21st century gis technologies are supporting the global fight against outbreaks and epidemics
- Mobile phone sensing systems: A survey. IEEE Communications Surveys & Tutorials 15, 402–427
- Ki, J. (2018). Gis and big data visualization. In Geographic Information Systems and Science (IntechOpen)
- Kimball, R. (1996). The data warehouse toolkit: practical techniques for building dimensional data warehouses (John Wiley & Sons, Inc.)
- Kitchin, R. (2016). The ethics of smart cities and urban science. Philosophical transactions of the royal society A: Mathematical, physical and engineering sciences 374, 20160115
- Breakthroughs in Statistics: Foundations and basic theory (Springer Science & Business Media)
- Fundamentals of clinical data science
- Geospatial big data: challenges and opportunities. Big Data Research 2, 74–81
- Geospatial big data handling theory and methods: A review and research challenges. ISPRS journal of Photogrammetry and Remote Sensing 115, 119–133
- Tree species classification using hyperion and sentinel-2 data with machine learning in south korea and china. ISPRS International Journal of Geo-Information 8, 150
- Machine learning in disaster management: recent developments in methods and applications. Machine Learning and Knowledge Extraction 4
- Geographic information science and systems (John Wiley & Sons)
- Geospatial analysis of environmental health, vol. 4 (Springer Science & Business Media)
- Big data: A revolution that will transform how we live, work, and think (Houghton Mifflin Harcourt)
- McKinney, W. (2012). Python for data analysis: Data wrangling with Pandas, NumPy, and IPython (” O’Reilly Media, Inc.”)
- Merchant, J. W. (2000). Remote sensing of the environment: an earth resource perspective. Cartography and Geographic Information Science 27, 311–311
- Mitchel, A. et al. (2005). The esri guide to gis analysis, volume 2: Spartial measurements and statistics. ESRI Guide to GIS analysis 2
- Mitchell, A. (1999). The ESRI guide to GIS analysis: geographic patterns & relationships, vol. 1 (ESRI, Inc.)
- Geospatial technologies for crops and soils: An overview. Geospatial technologies for crops and soils , 1–48
- The ethics of algorithms: Mapping the debate. Big Data & Society 3, 2053951716679679
- Enabling cognitive smart cities using big data and machine learning: Approaches and challenges. IEEE Communications Magazine 56, 94–101
- Flood prediction using machine learning models: Literature review. Water 10, 1536
- Neuman, B. C. (1994). Scale in distributed systems. Readings in distributed computing systems
- Multispectral indices for wildfire management
- Peterson, M. P. (2005). Maps and the Internet (Elsevier)
- Satellite remote sensing for applied ecologists: opportunities and challenges. Journal of Applied Ecology 51, 839–848
- National spatial crop yield simulation using gis-based crop production model. Ecological Modelling 136, 113–129
- Spatial modelling of soil-transmitted helminth infections in kenya: a disease control planning tool. PLoS neglected tropical diseases 5, e958
- Geographic data science with python (CRC Press)
- Geographic Information Systems and Science (IntechOpen)
- Data science, predictive analytics, and big data in supply chain management: Current state and future potential. Journal of Business Logistics 36, 120–132
- Global, 30-m resolution continuous fields of tree cover: Landsat-based rescaling of modis vegetation continuous fields with lidar-based estimates of error. International Journal of Digital Earth 6, 427–448
- Thematic cartography and geovisualization (CRC Press)
- Vineyard gap detection by convolutional neural networks fed by multi-spectral images. Algorithms 15, 440
- Real-time collaborative gis: A technological review. ISPRS Journal of Photogrammetry and remote sensing 115, 143–152
- Global irrigated area map (giam), derived from remote sensing, for the end of the last millennium. International journal of remote sensing 30, 3679–3733
- Tomlinson, R. F. (1974). Geographical information systems, spatial data analysis and decision making in government. Ph.D. thesis, University of London
- Big data for transportation and mobility: recent advances, trends and challenges. IET Intelligent Transport Systems 12, 742–755
- Data science in action (Springer)
- Analysis of time-series modis 250 m vegetation index data for crop classification in the us central great plains. Remote sensing of environment 108, 290–310
- Werner, M. (2019). Parallel processing strategies for big geospatial data. Frontiers in big Data 2, 44
- Application of data science technologies in intelligent prediction of traffic congestion. Journal of Advanced Transportation 2019
- Impediments to using gis for real-time disaster decision support. Computers, environment and urban systems 27, 123–141
- Geographical information system parallelization for spatial big data processing: a review. Cluster Computing 19, 139–152