Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Predicting Species Occurrence Patterns from Partial Observations (2403.18028v2)

Published 26 Mar 2024 in cs.LG, cs.AI, cs.CV, and q-bio.PE

Abstract: To address the interlinked biodiversity and climate crises, we need an understanding of where species occur and how these patterns are changing. However, observational data on most species remains very limited, and the amount of data available varies greatly between taxonomic groups. We introduce the problem of predicting species occurrence patterns given (a) satellite imagery, and (b) known information on the occurrence of other species. To evaluate algorithms on this task, we introduce SatButterfly, a dataset of satellite images, environmental data and observational data for butterflies, which is designed to pair with the existing SatBird dataset of bird observational data. To address this task, we propose a general model, R-Tran, for predicting species occurrence patterns that enables the use of partial observational data wherever found. We find that R-Tran outperforms other methods in predicting species encounter rates with partial information both within a taxon (birds) and across taxa (birds and butterflies). Our approach opens new perspectives to leveraging insights from species with abundant data to other species with scarce data, by modelling the ecosystems in which they co-occur.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (27)
  1. Census Bureau of USA. https://www.census.gov/geographies/mapping-files/time-series/geo/carto-boundary-file.html, 2018. Accessed: 2023-06-06.
  2. Heterogeneity in habitat and microclimate delay butterfly community tracking of climate change over an elevation gradient. Biological Conservation, 289:110389, 2024.
  3. Integrating machine learning, remote sensing and citizen science to create an early warning system for biodiversity. Plants, people, planet, 5(3):307–316, 2023.
  4. Species distribution modeling for machine learning practitioners: A review. In ACM SIGCAS conference on computing and sustainable societies, pp.  329–348, 2021.
  5. The GeoLifeCLEF 2020 dataset. Preprint arXiv:2004.04192, 2020.
  6. Quantifying relationships between bird and butterfly community shifts and environmental change. Ecological Applications, 16(1):380–393, 2006.
  7. Patterns and causes of covariation in bird and butterfly community structure. Landscape ecology, 30:1461–1472, 2015.
  8. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, KDD’96, pp.  226–231. AAAI Press, 1996.
  9. Deep species distribution modeling from sentinel-2 image time-series: a global scale analysis on the orchid family. Frontiers in Plant Science, 13:839327, 2022.
  10. Butterfly ecology. Annual review of ecology and systematics, 6(1):365–395, 1975.
  11. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  770–778, 2016.
  12. Soilgrids250m: Global gridded soil information based on machine learning. PLoS one, 12(2):e0169748, 2017.
  13. Hijmans and al. Worldclim 1.4 (historical climate conditions). International journal of climatology, 25, 2005.
  14. Responses of butterflies to twentieth century climate warming: implications for future ranges. Proceedings of the Royal Society of London. Series B: Biological Sciences, 269(1505):2163–2171, 2002.
  15. Overview of lifeclef 2022: an evaluation of machine-learning based species identification and species distribution prediction. In International Conference of the Cross-Language Evaluation Forum for European Languages, pp.  257–285. Springer, 2022.
  16. eBird: A human/computer learning network for biodiversity conservation and research. AI Magazine, 34, 03 2013.
  17. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  18. General multi-label image classification with transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.  16478–16488, June 2021.
  19. Decoupled weight decay regularization. In International Conference on Learning Representations, 2019. URL https://openreview.net/forum?id=Bkg6RiCqY7.
  20. The partnership of citizen science and machine learning: benefits, risks, and future challenges for engagement, data collection, and data quality. Sustainability, 13(14):8087, 2021.
  21. Ipbes-ipcc co-sponsored workshop report on biodiversity and climate change. IPBES and IPCC, 10, 2021.
  22. ebutterfly: leveraging massive online citizen science for butterfly conservation. Insects, 8(2):53, 2017.
  23. C Sudhakar Reddy. Remote sensing of biodiversity: what to measure and monitor from space to species? Biodiversity and Conservation, 30(10):2617–2631, 2021.
  24. Climate change drives mountain butterflies towards the summits. Scientific Reports, 11(1):14382, 2021.
  25. Satbird: a dataset for bird species distribution modeling using remote sensing and citizen science data. In Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2023. URL https://openreview.net/forum?id=Vn5qZGxGj3.
  26. Remote sensing of ecology, biodiversity and conservation: a review from the perspective of remote sensing specialists. Sensors, 10(11):9647–9667, 2010.
  27. Feedback-prop: Convolutional neural network inference under partial evidence. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018.

Summary

We haven't generated a summary for this paper yet.