Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards (1610.08120v1)

Published 25 Oct 2016 in cs.RO, cs.CV, and cs.LG

Abstract: Ground vehicles equipped with monocular vision systems are a valuable source of high resolution image data for precision agriculture applications in orchards. This paper presents an image processing framework for fruit detection and counting using orchard image data. A general purpose image segmentation approach is used, including two feature learning algorithms; multi-scale Multi-Layered Perceptrons (MLP) and Convolutional Neural Networks (CNN). These networks were extended by including contextual information about how the image data was captured (metadata), which correlates with some of the appearance variations and/or class distributions observed in the data. The pixel-wise fruit segmentation output is processed using the Watershed Segmentation (WS) and Circular Hough Transform (CHT) algorithms to detect and count individual fruits. Experiments were conducted in a commercial apple orchard near Melbourne, Australia. The results show an improvement in fruit segmentation performance with the inclusion of metadata on the previously benchmarked MLP network. We extend this work with CNNs, bringing agrovision closer to the state-of-the-art in computer vision, where although metadata had negligible influence, the best pixel-wise F1-score of $0.791$ was achieved. The WS algorithm produced the best apple detection and counting results, with a detection F1-score of $0.858$. As a final step, image fruit counts were accumulated over multiple rows at the orchard and compared against the post-harvest fruit counts that were obtained from a grading and counting machine. The count estimates using CNN and WS resulted in the best performance for this dataset, with a squared correlation coefficient of $r^2=0.826$.

Citations (382)

View on Semantic Scholar

Summary

The paper demonstrates robust segmentation techniques, with CNNs achieving a pixel-wise F1-score of 0.791 and outperforming ms-MLP for fruit detection.
It employs metadata integration and watershed segmentation (F1-score of 0.858) to enhance fruit counting and yield estimation accuracy.
The research underpins practical applications in precision agriculture, enabling accurate yield predictions and advancing autonomous orchard operations.

Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards: An Expert Overview

The paper "Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards" by Suchet Bargoti and James P. Underwood addresses the vital issue of efficient yield estimation in precision agriculture via advanced image processing techniques. The research proffers a detailed analysis of image segmentation frameworks tailored to identify and count fruits within orchard environments using image data captured by ground vehicles equipped with monocular vision systems.

The authors propose a segmentation framework leveraging modern feature learning methods, specifically evaluating the efficacy of multi-scale Multi-Layered Perceptrons (ms-MLP) and Convolutional Neural Networks (CNNs). Integral to these approaches is the inclusion of metadata—contextual details such as camera positions and environmental conditions—to bolster the classification performance by accounting for intra-class variations present in the orchard's imaging data.

The experimental validations are conducted within an apple orchard near Melbourne, and the paper reports a pixel-wise fruit segmentation F1-score of 0.791 employing CNNs. The CNNs outperformed the ms-MLP in terms of segmentation accuracy; however, the inclusion of metadata—a pivotal component that positively influenced the ms-MLP performance—had negligible impact on the CNN results, indicating the inherent robustness of CNNs in capturing complex data distributions without auxiliary data.

Once segmented, the fruit detection task is accomplished using the Watershed Segmentation (WS) and Circular Hough Transform (CHT) algorithms, demonstrating WS as superior with an F1-score of 0.858 for apple detection and counting. These results are pivotal, providing a solid foundation for yield estimation, which is further fortified by a squared correlation coefficient of r²=0.826 when compared to post-harvest counts. Such numerical results signify the practical applicability of this paper in delivering accurate, real-time yield predictions, facilitating enhanced resource management in orchards.

The implications of this work extend beyond yield estimation, signaling advancements in robotics applications like autonomous fruit picking and tree modeling. The robust methods proposed are poised to bridge the gap between agrovision and state-of-the-art computer vision, a promising trajectory for future in-field robotic operations.

While the methodology delineates significant progress, potential areas for exploration include refining detection algorithms to manage occlusions, enhancing robustness against varying illumination without controlled environments, and assessing generalization capabilities across cultivar variations or different fruit types.

In conclusion, this research contributes significantly to the field of precision agriculture by setting a strong precedent for integrating computer vision techniques with ground-based orchard operations. The findings not only underscore the capabilities of CNNs in agricultural settings but also highlight the potential of metadata in enhancing simpler models, providing a stepping-stone for future innovations aimed at revolutionizing agricultural productivity through AI and automation.

PDF Markdown

Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards (1610.08120v1)

Summary

Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards: An Expert Overview

Related Papers