Self-Supervised Backbone Framework for Diverse Agricultural Vision Tasks (2403.15248v1)
Abstract: Computer vision in agriculture is game-changing with its ability to transform farming into a data-driven, precise, and sustainable industry. Deep learning has empowered agriculture vision to analyze vast, complex visual data, but heavily rely on the availability of large annotated datasets. This remains a bottleneck as manual labeling is error-prone, time-consuming, and expensive. The lack of efficient labeling approaches inspired us to consider self-supervised learning as a paradigm shift, learning meaningful feature representations from raw agricultural image data. In this work, we explore how self-supervised representation learning unlocks the potential applicability to diverse agriculture vision tasks by eliminating the need for large-scale annotated datasets. We propose a lightweight framework utilizing SimCLR, a contrastive learning approach, to pre-train a ResNet-50 backbone on a large, unannotated dataset of real-world agriculture field images. Our experimental analysis and results indicate that the model learns robust features applicable to a broad range of downstream agriculture tasks discussed in the paper. Additionally, the reduced reliance on annotated data makes our approach more cost-effective and accessible, paving the way for broader adoption of computer vision in agriculture.
- The European Space Agency. Sentinel-2: Colour vision for copernicus. https://www.esa.int/Applications/Observing_the_Earth/Copernicus/Sentinel-2, 2015. Accessed: February 27, 2024.
- Deepwheat: Estimating phenotypic traits from crop images with deep learning. In 2018 IEEE Winter conference on applications of computer vision (WACV), pages 323–332. IEEE, 2018.
- Self-supervised augmentation consistency for adapting semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15384–15394, 2021.
- High-resolution uav image generation for sorghum panicle detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1676–1685, 2022.
- Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence, 40(4):834–848, 2017.
- Big self-supervised models are strong semi-supervised learners. In Advances in Neural Information Processing Systems, pages 22243–22255. Curran Associates, Inc., 2020.
- Pseudo-label generation for agricultural robotics applications. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1686–1694, 2022.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
- Diffusion models beat gans on image synthesis. Advances in neural information processing systems, 34:8780–8794, 2021.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- Generative adversarial networks. Communications of the ACM, 63(11):139–144, 2020.
- Self-supervised contrastive learning on agricultural images. Computers and Electronics in Agriculture, 191:106510, 2021.
- Minneapple: A benchmark dataset for apple detection and segmentation. IEEE Robotics and Automation Letters, 5(2):852–858, 2020.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- Mask r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 2961–2969, 2017.
- Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 16000–16009, 2022.
- Self-supervised visual feature learning with deep neural networks: A survey. IEEE transactions on pattern analysis and machine intelligence, 43(11):4037–4058, 2020.
- Towards precision agriculture: Iot-enabled intelligent irrigation systems using deep learning neural network. IEEE Sensors Journal, 21(16):17479–17491, 2021.
- Deep learning classification of land cover and crop types using remote sensing data. IEEE Geoscience and Remote Sensing Letters, 14(5):778–782, 2017.
- Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740–755. Springer, 2014.
- Umap: Uniform manifold approximation and projection for dimension reduction, 2020.
- Using deep learning for image-based plant disease detection. Frontiers in plant science, 7:1419, 2016.
- Monitoring agriculture areas with satellite images and deep learning. Applied Soft Computing, 95:106565, 2020.
- Benchmarking self-supervised contrastive learning methods for image-based plant phenotyping. Plant Phenomics, 5:0037, 2023.
- Deepweeds: A multiclass weed species image dataset for deep learning. Scientific reports, 9(1):2058, 2019.
- Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193, 2023.
- On the real-time semantic segmentation of aphid clusters in the wild. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6298–6305, 2023.
- Towards computer vision and deep learning facilitated pollination monitoring for agriculture. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2921–2930, 2021.
- Multi-resolution outlier pooling for sorghum classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2931–2939, 2021.
- Mushroom segmentation and 3d pose estimation from point clouds using fully convolutional geometric features and implicit pose encoding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6263–6270, 2023.
- Automation in agriculture by machine and deep learning techniques: A review of recent developments. Precision Agriculture, 22:2053–2091, 2021.
- High throughput phenotyping to accelerate crop breeding and monitoring of diseases in the field. Current opinion in plant biology, 38:184–192, 2017.
- Kihyuk Sohn. Improved deep metric learning with multi-class n-pair loss objective. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2016.
- Conditional image generation with pixelcnn decoders. Advances in neural information processing systems, 29, 2016.
- Understanding the behaviour of contrastive loss. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2495–2504, 2021.
- 3d point cloud instance segmentation of lettuce based on partnet. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1647–1655, 2022.
- Dense contrastive learning for self-supervised visual pre-training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3024–3033, 2021.
- Eca-convnext: A rice leaf disease identification model based on convnext. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pages 6235–6243, 2023.
- Cross-regional oil palm tree detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 56–57, 2020.
- Self-supervised collaborative multi-network for fine-grained visual categorization of tomato diseases. IEEE Access, 8:211912–211923, 2020a.
- A near real-time deep learning approach for detecting rice phenology based on uav images. Agricultural and Forest Meteorology, 287:107938, 2020b.
- Large batch training of convolutional networks, 2017.
- Weed detection in perennial ryegrass with deep learning convolutional neural network. Frontiers in plant science, 10:1422, 2019.
- Cropdeep: The crop vision dataset for deep-learning-based classification and detection in precision agriculture. Sensors, 19(5):1058, 2019.