Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

BenthIQ: a Transformer-Based Benthic Classification Model for Coral Restoration (2311.13661v1)

Published 22 Nov 2023 in cs.CV

Abstract: Coral reefs are vital for marine biodiversity, coastal protection, and supporting human livelihoods globally. However, they are increasingly threatened by mass bleaching events, pollution, and unsustainable practices with the advent of climate change. Monitoring the health of these ecosystems is crucial for effective restoration and management. Current methods for creating benthic composition maps often compromise between spatial coverage and resolution. In this paper, we introduce BenthIQ, a multi-label semantic segmentation network designed for high-precision classification of underwater substrates, including live coral, algae, rock, and sand. Although commonly deployed CNNs are limited in learning long-range semantic information, transformer-based models have recently achieved state-of-the-art performance in vision tasks such as object detection and image classification. We integrate the hierarchical Swin Transformer as the backbone of a U-shaped encoder-decoder architecture for local-global semantic feature learning. Using a real-world case study in French Polynesia, we demonstrate that our approach outperforms traditional CNN and attention-based models on pixel-wise classification of shallow reef imagery.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (46)
  1. Automatic building extraction on satellite images using unet and resnet50. Computational Intelligence and Neuroscience, 2022, 2022.
  2. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence, 39(12):2481–2495, 2017.
  3. The value of estuarine and coastal ecosystem services. Ecological monographs, 81(2):169–193, 2011.
  4. End-to-end object detection with transformers. In European conference on computer vision, pp.  213–229. Springer, 2020.
  5. Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587, 2017.
  6. Very high resolution mapping of coral reef state using airborne bathymetric lidar surface-intensity and drone imagery. International journal of remote sensing, 39(17):5676–5688, 2018.
  7. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pp.  248–255. Ieee, 2009.
  8. An image is worth 16x16 words: Transformers for image recognition at scale. arxiv 2020. arXiv preprint arXiv:2010.11929, 2010.
  9. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
  10. Comparative evaluation of free web tools imagej and photopea for the surface area quantification of planar substrates and organisms. Diversity, 14(4):272, 2022.
  11. Land cover classification of resources survey remote sensing images based on segmentation model. IEEE Access, 10:56267–56281, 2022.
  12. Global threats to coral reefs: coral bleaching, global climate change, disease, predator plagues and invasive species. Status of coral reefs of the world, 2004:67–92, 2004.
  13. Back-to-back coral bleaching events on isolated atolls in the coral sea. Coral Reefs, 38:713–719, 2019.
  14. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  770–778, 2016.
  15. Swin transformer embedding unet for remote sensing image semantic segmentation. IEEE Transactions on Geoscience and Remote Sensing, 60:1–15, 2022.
  16. Remote sensing of coral reefs for monitoring and management: a review. Remote Sensing, 8(2):118, 2016.
  17. Global warming and recurrent mass bleaching of corals. Nature, 543(7645):373–377, 2017.
  18. Quantifying growth in maricultured corals using photogrammetry. Aquaculture Research, 49(6):2249–2255, 2018.
  19. Transformer-based visual segmentation: A survey. arXiv preprint arXiv:2304.09854, 2023.
  20. Dice loss for data-imbalanced nlp tasks. arXiv preprint arXiv:1911.02855, 2019.
  21. Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  1925–1934, 2017.
  22. Development and application of a video-mosaic survey technology to document the status of coral reef communities. Environmental monitoring and assessment, 125:59–73, 2007.
  23. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pp.  10012–10022, 2021.
  24. Deep learning in remote sensing applications: A meta-analysis and review. ISPRS Journal of Photogrammetry and Remote Sensing, 152:166–177, 2019. ISSN 0924-2716. doi: https://doi.org/10.1016/j.isprsjprs.2019.04.015. URL https://www.sciencedirect.com/science/article/pii/S0924271619301108.
  25. DE. McAllister. Environmental, economic and social costs of coral reef destruction in the philippines. Galaxea, 72:161–178, 1988. URL https://eurekamag.com/research/020/972/020972736.php.
  26. Interactions between corals and their symbiotic algae. Coral reefs in the Anthropocene, pp.  99–116, 2015.
  27. Remote sensing of coral reefs and their physical environment. Marine pollution bulletin, 48(3-4):219–228, 2004.
  28. Hybrid multiple attention network for semantic segmentation in aerial images. IEEE Transactions on Geoscience and Remote Sensing, 60:1–18, 2021.
  29. Stuart R Phinn. Coral Reef Remote Sensing-a Guide for Mapping, Monitoring and Management. Springer, 2011.
  30. Optimizing intersection-over-union in deep neural networks for image segmentation. In International symposium on visual computing, pp. 234–244. Springer, 2016.
  31. Size structure of the coral stylophora pistillata across reef flat zones in the central red sea. Scientific Reports, 12(1):13979, 2022.
  32. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pp.  234–241. Springer, 2015.
  33. Semi-automated object-based classification of coral reef habitat using discrete choice models. Remote Sensing, 7(12):15894–15916, 2015.
  34. Attention gated networks: Learning to leverage salient regions in medical images. Medical image analysis, 53:197–207, 2019.
  35. Sen12ms–a curated dataset of georeferenced multi-spectral sentinel-1/2 imagery for deep learning and data fusion. arXiv preprint arXiv:1906.07789, 2019.
  36. SV Smith. Coral-reef area and the contributions of reefs to processes and resources of the world’s oceans. Nature, 273(5659):225–226, 1978.
  37. Spg-net: Segmentation prediction and guidance network for image inpainting. arXiv preprint arXiv:1805.03356, 2018.
  38. Semantic segmentation using vision transformers: A survey. Engineering Applications of Artificial Intelligence, 126:106669, 2023.
  39. Attention is all you need. Advances in neural information processing systems, 30, 2017.
  40. Efficient transformer for remote sensing image segmentation. Remote Sensing, 13(18):3585, 2021.
  41. Mapping the change of coral reefs using remote sensing and in situ measurements: a case study in pangkajene and kepulauan regency, spermonde archipelago, indonesia. Journal of oceanography, 73:623–645, 2017.
  42. Learning a discriminative feature network for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  1857–1866, 2018.
  43. Semantic segmentation with extended deeplabv3 architecture. In 2019 27th Signal Processing and Communications Applications Conference (SIU), pp.  1–4. IEEE, 2019.
  44. Deep learning for semantic segmentation of coral images in underwater photogrammetry. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2:343–350, 2022.
  45. Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp.  2881–2890, 2017.
  46. Combining photogrammetric computer vision and semantic segmentation for fine-grained understanding of coral reef growth under climate change. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp.  186–195, 2023.

Summary

We haven't generated a summary for this paper yet.