GEO-Bench: Toward Foundation Models for Earth Monitoring (2306.03831v2)
Abstract: Recent progress in self-supervision has shown that pre-training large neural networks on vast amounts of unsupervised data can lead to substantial increases in generalization to downstream tasks. Such models, recently coined foundation models, have been transformational to the field of natural language processing. Variants have also been proposed for image data, but their applicability to remote sensing tasks is limited. To stimulate the development of foundation models for Earth monitoring, we propose a benchmark comprised of six classification and six segmentation tasks, which were carefully curated and adapted to be both relevant to the field and well-suited for model evaluation. We accompany this benchmark with a robust methodology for evaluating models and reporting aggregated results to enable a reliable assessment of progress. Finally, we report results for 20 baselines to gain information about the performance of existing models. We believe that this benchmark will be a driver of progress across a variety of Earth monitoring tasks.
- Remote sensing dataset for detecting cows from high resolution aerial images. 2022.
- Deep reinforcement learning at the edge of the statistical precipice. Advances in neural information processing systems, 34:29304–29320, 2021.
- Hamed Alemohammad. The case for open-access ML-ready geospatial training data. In International Geoscience and Remote Sensing Symposium. IEEE, 2021.
- The arcade learning environment: An evaluation platform for general agents. Journal of Artificial Intelligence Research, 47:253–279, 2013.
- On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, pages 610–623, 2021.
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258, 2021.
- Language models are few-shot learners. arXiv preprint arXiv:2005.14165, 2020.
- Using satellite imagery to understand and promote sustainable development. Science, 371(6535), 2021.
- Emerging properties in self-supervised vision transformers. arXiv preprint arXiv:2104.14294, 2021.
- Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587, 2017.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR, 2020.
- The geolifeclef 2020 dataset. arXiv preprint arXiv:2004.04192, 2020.
- Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery. arXiv preprint arXiv:2207.08051, 2022.
- High-resolution soybean yield mapping across the us midwest using subfield harvester data. Remote Sensing, 12(21):3471, 2020.
- Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
- Automated identification of oil field features using cnns. 2020.
- Current trends in deep learning for earth observation: An open-source benchmark arena for image classification. ISPRS Journal of Photogrammetry and Remote Sensing, 197:18–35, 2023.
- An image is worth 16x16 words: Transformers for image recognition at scale. arxiv 2020. arXiv preprint arXiv:2010.11929, 2010.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- Sentinel-2: Esa’s optical high-resolution mission for gmes operational services. Remote sensing of Environment, 120:25–36, 2012.
- B Eforn. Bootstrap methods: another look at the jackknife. The Annals of Statistics, 7:1–26, 1979.
- EPA. Greenhouse Gas Emissions: Understanding Global Warming Potentials. Technical report, US Environmental Protection Agency, February 2017. URL https://www.epa.gov/ghgemissions/understanding-global-warming-potentials.
- ESA. Sentinel-2. Technical report, European Space Agency, Paris, France, 2021. URL https://sentinel.esa.int/web/sentinel/missions/sentinel-2.
- William Falcon and The PyTorch Lightning team. PyTorch Lightning, 3 2019. URL https://github.com/Lightning-AI/lightning.
- Object detection with discriminatively trained part-based models. IEEE transactions on pattern analysis and machine intelligence, 32(9):1627–1645, 2010.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- Eurosat: A novel dataset and deep learning benchmark for land use and land cover classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2019.
- Using self-supervised learning can improve model robustness and uncertainty. arXiv preprint arXiv:1906.12340, 2019.
- Forestnet: Classifying drivers of deforestation in indonesia using deep learning on satellite imagery. arXiv preprint arXiv:2011.05479, 2020.
- Tile2vec: Unsupervised representation learning for spatially distributed data. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 3967–3974, 2019.
- Airborne methane surveys pay for themselves: An economic case study of increased revenue from emissions control. preprint, Environmental Monitoring, July 2021. URL http://eartharxiv.org/repository/view/2532/.
- Methanet - an ai-driven approach to quantifying methane point-source emission from high-resolution 2-d plume imagery. ICML Workshop on Tackling Climate Change with AI, 2021.
- Scaling laws for neural language models. arXiv preprint arXiv:2001.08361, 2020.
- Rapid response crop maps in data sparse regions. arXiv preprint arXiv:2006.16866, 2020.
- Quantifying the carbon emissions of machine learning. arXiv preprint arXiv:1910.09700, 2019.
- Counting cows: Tracking illegal cattle ranching from high-resolution satellite imagery. arXiv preprint arXiv:2011.07369, 2020.
- Scalable deep learning to identify brick kilns and aid regulatory capacity. Proceedings of the National Academy of Sciences, 118(17), 2021. ISSN 0027-8424. doi: 10.1073/pnas.2018863118. URL https://www.pnas.org/content/118/17/e2018863118.
- Learning to count objects in images. Advances in neural information processing systems, 23, 2010.
- Rsi-cb: A large-scale remote sensing image classification benchmark using crowdsourced data. Sensors, 20(6):1594, 2020.
- Swin transformer v2: Scaling up capacity and resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12009–12019, 2022a.
- A convnet for the 2020s. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11976–11986, 2022b.
- Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101, 2017.
- Machine learning-based estimation of forest carbon stocks to increase transparency of forest preservation efforts. 2019 NeurIPS Workshop on Tackling Climate Change with AI (CCAI), 2019.
- Physically-consistent generative adversarial networks for coastal flood visualization. ICML Workshop on AI for Modeling Oceans and Climate Change (AIMOCC), 2021.
- Deep learning in remote sensing applications: A meta-analysis and review. ISPRS journal of photogrammetry and remote sensing, 152:166–177, 2019.
- Seasonal contrast: Unsupervised pre-training from uncurated remote sensing data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9414–9423, 2021.
- Advancing ai for earth science: A data systems perspective. Eos, 101, 2020.
- 3d-pv-locator: Large-scale detection of rooftop-mounted photovoltaic systems in 3d. Applied Energy, 310:118469, 2022. ISSN 0306-2619. doi: https://doi.org/10.1016/j.apenergy.2021.118469. URL https://www.sciencedirect.com/science/article/pii/S0306261921016937.
- Using artificial intelligence to improve real-time decision-making for high-impact weather. Bulletin of the American Meteorological Society, 98(10), 2017.
- Never-ending learning. Communications of the ACM, 61(5):103–115, April 2018. ISSN 0001-0782, 1557-7317. doi: 10.1145/3191513. URL https://dl.acm.org/doi/10.1145/3191513.
- Chesapeake bay program partnership high-resolution land cover classification accuracy assessment methodology, 2017. URL https://chesapeakeconservancy.org/wp-content/uploads/2017/01/Chesapeake_Conservancy_Accuracy_Assessment_Methodology.pdf.
- Carbon emissions and large neural network training. arXiv preprint arXiv:2104.10350, 2021.
- Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 44–51, 2015.
- High-Pass Filters to Reduce the Effects of Broad Atmospheric Contributions in Sbas Inversions: A Case Study in the Delaware Basin. In IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium, pages 1030–1033, Waikoloa, HI, USA, September 2020. IEEE. ISBN 978-1-72816-374-1. doi: 10.1109/IGARSS39084.2020.9324656. URL https://ieeexplore.ieee.org/document/9324656/.
- Learning transferable visual models from natural language supervision. arXiv preprint arXiv:2103.00020, 2021.
- Large scale high-resolution land cover mapping with multi-resolution data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 12726–12735, 2019.
- Tackling climate change with machine learning. arXiv preprint arXiv:1906.05433, 2019.
- U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention, pages 234–241. Springer, 2015.
- CodeCarbon: Estimate and Track Carbon Emissions from Machine Learning Computing. 2021. doi: 10.5281/zenodo.4658424.
- Green ai. Communications of the ACM, 63(12):54–63, 2020.
- Ognet: Towards a global oil and gas infrastructure database using deep learning on remotely sensed imagery. arXiv preprint arXiv:2011.07227, 2020.
- Torchgeo: deep learning with geospatial data. arXiv preprint arXiv:2111.08872, 2021.
- Energy and policy considerations for deep learning in nlp. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3645–3650, 2019.
- Bigearthnet-mm: A large-scale, multimodal, multilabel benchmark archive for remote sensing image classification and retrieval [software and data sets]. IEEE Geoscience and Remote Sensing Magazine, 9(3):174–180, 2021.
- What makes for good views for contrastive learning? Advances in Neural Information Processing Systems, 33:6827–6839, 2020.
- USGS. Landsat 8. Technical report, United States Geological Survey, Reston, Virginia, USA, 2021. URL https://www.usgs.gov/core-science-systems/nli/landsat/landsat-8?qt-science_support_page_related_con=0#qt-science_support_page_related_con.
- Learning to interpret satellite images using wikipedia. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- An empirical study of remote sensing pretraining. IEEE Transactions on Geoscience and Remote Sensing, pages 1–1, 2022. doi: 10.1109/TGRS.2022.3176603.
- Ssl4eo-s12: A large-scale multi-modal, multi-temporal dataset for self-supervised learning in earth observation.
- A benchmark dataset for canopy crown detection and delineation in co-registered airborne rgb, lidar and hyperspectral imagery from the national ecological observation network. PLoS computational biology, 17(7):e1009180, 2021.
- Earthnets: Empowering ai in earth observation. arXiv preprint arXiv:2210.04936, 2022.
- Sustainbench: Benchmarks for monitoring the sustainable development goals with machine learning. arXiv preprint arXiv:2111.04724, 2021.
- Smallholder cashew plantations in benin, 2021.
- Cumulo: A dataset for learning cloud classes. arXiv preprint arXiv:1911.04227, 2019.
- Deep learning in remote sensing: A comprehensive review and list of resources. IEEE Geoscience and Remote Sensing Magazine, 5(4):8–36, 2017.
- So2sat lcz42: A benchmark dataset for global local climate zones classification. arXiv preprint arXiv:1912.12171, 2019.