SepHRNet: Generating High-Resolution Crop Maps from Remote Sensing imagery using HRNet with Separable Convolution (2307.05700v1)
Abstract: The accurate mapping of crop production is crucial for ensuring food security, effective resource management, and sustainable agricultural practices. One way to achieve this is by analyzing high-resolution satellite imagery. Deep Learning has been successful in analyzing images, including remote sensing imagery. However, capturing intricate crop patterns is challenging due to their complexity and variability. In this paper, we propose a novel Deep learning approach that integrates HRNet with Separable Convolutional layers to capture spatial patterns and Self-attention to capture temporal patterns of the data. The HRNet model acts as a backbone and extracts high-resolution features from crop images. Spatially separable convolution in the shallow layers of the HRNet model captures intricate crop patterns more effectively while reducing the computational cost. The multi-head attention mechanism captures long-term temporal dependencies from the encoded vector representation of the images. Finally, a CNN decoder generates a crop map from the aggregated representation. Adaboost is used on top of this to further improve accuracy. The proposed algorithm achieves a high classification accuracy of 97.5\% and IoU of 55.2\% in generating crop maps. We evaluate the performance of our pipeline on the Zuericrop dataset and demonstrate that our results outperform state-of-the-art models such as U-Net++, ResNet50, VGG19, InceptionV3, DenseNet, and EfficientNet. This research showcases the potential of Deep Learning for Earth Observation Systems.
- Spatio-temporal crop mapping using a convolutional neural network and long short-term memory network. Remote Sensing 14, 1 (2022), 56.
- Vincent Dumoulin and Francesco Visin. 2016. A guide to convolution arithmetic for deep learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2243–2251.
- Yoav Freund and Robert E Schapire. 1997. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and system sciences 55, 1 (1997), 119–139.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (1997), 1735–1780.
- A Multi-Scale and Multi-Modal Deep Learning Approach for Spatio-Temporal Crop Mapping Using Sentinel-2 and Landsat-8 Imagery. Remote Sensing 13, 5 (2021), 855. \urldef\tempurl\urlhttps://doi.org/10.3390/rs13050855 \tempurl
- Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4700–4708.
- Crop classification using machine learning algorithms on time series Sentinel-2 data. International Journal of Remote Sensing 41, 11 (2020), 4241–4267.
- Mapping crops within the growing season across the United States. Remote Sensing of Environment 251 (2020), 112048.
- Temporal Consistency and Variability of Optical Indices for Crop Mapping: A Case Study in Southwest China. Remote Sensing 13, 5 (2021), 918. \urldef\tempurl\urlhttps://doi.org/10.3390/rs13050918 \tempurl
- Incorporating Temporal Variability of Crop Reflectance for Crop Classification Using Time Series Sentinel-2 Data. Remote Sensing 12, 12 (2020), 1971. \urldef\tempurl\urlhttps://doi.org/10.3390/rs12121971 \tempurl
- Semantic segmentation of crop type in Africa: A novel dataset and analysis of deep learning methods. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 75–82.
- Improvement in Land Cover and Crop Classification based on Temporal Features Learning from Sentinel-2 Data Using Recurrent-Convolutional Neural Network (R-CNN). Applied Sciences 10, 1 (2020), 238.
- U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, 234–241.
- Breizhcrops: A time series dataset for crop type mapping. arXiv preprint arXiv:1905.11893 (2019).
- Multi-Temporal Crop Mapping Using U-Net Architecture. Remote Sensing 12, 18 (2020), 2974. \urldef\tempurl\urlhttps://doi.org/10.3390/rs12182974 \tempurl
- Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
- Deep High-Resolution Representation Learning for Visual Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5693–5703.
- Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1–9.
- Mingxing Tan and Quoc V Le. 2019. Efficientnet: Rethinking model scaling for convolutional neural networks. In Proceedings of the IEEE International Conference on Computer Vision. 610–619.
- Crop mapping from image time series: Deep learning with multi-scale label hierarchies. Remote Sensing of Environment 264 (2021), 112603.
- Attention is All You Need. In Proceedings of the 31st Conference on Neural Information Processing Systems. 5998–6008.
- Multi-Scale Feature Fusion for Spatio-Temporal Crop Mapping Using Sentinel-2 Imagery. Remote Sensing 13, 3 (2021), 528. \urldef\tempurl\urlhttps://doi.org/10.3390/rs13030528 \tempurl
- Spatiotemporal Cropland Mapping Using a Combined Features Approach. Remote Sensing 13, 4 (2021), 732. \urldef\tempurl\urlhttps://doi.org/10.3390/rs13040732 \tempurl
- Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4. Springer, 3–11.
- Spatio-Temporal Crop Mapping Using Multi-Scale Convolutional Neural Networks. Remote Sensing 13, 2 (2021), 261. \urldef\tempurl\urlhttps://doi.org/10.3390/rs13020261 \tempurl