A pipeline for multiple orange detection and tracking with 3-D fruit relocalization and neural-net based yield regression in commercial citrus orchards (2312.16724v1)
Abstract: Traditionally, sweet orange crop forecasting has involved manually counting fruits from numerous trees, which is a labor-intensive process. Automatic systems for fruit counting, based on proximal imaging, computer vision, and machine learning, have been considered a promising alternative or complement to manual counting. These systems require data association components that prevent multiple counting of the same fruit observed in different images. However, there is a lack of work evaluating the accuracy of multiple fruit counting, especially considering (i) occluded and re-entering green fruits on leafy trees, and (ii) counting ground-truth data measured in the crop field. We propose a non-invasive alternative that utilizes fruit counting from videos, implemented as a pipeline. Firstly, we employ CNNs for the detection of visible fruits. Inter-frame association techniques are then applied to track the fruits across frames. To handle occluded and re-appeared fruit, we introduce a relocalization component that employs 3-D estimation of fruit locations. Finally, a neural network regressor is utilized to estimate the total number of fruit, integrating image-based fruit counting with other tree data such as crop variety and tree size. The results demonstrate that the performance of our approach is closely tied to the quality of the field-collected videos. By ensuring that at least 30% of the fruit is accurately detected, tracked, and counted, our yield regressor achieves an impressive coefficient of determination of 0.85. To the best of our knowledge, this study represents one of the few endeavors in fruit estimation that incorporates manual fruit counting as a reference point for evaluation. We also introduce annotated datasets for multiple orange tracking (MOrangeT) and detection (OranDet), publicly available to foster the development of novel methods for image-based fruit counting.
- A survey: object detection methods from CNN to transformer. Multimedia Tools and Applications, 82:21353–21383, 6 2023. ISSN 15737721. doi:10.1007/S11042-022-13801-3.
- Caracterização fenológica de variedades do tipo murcott em três porta-enxertos. Revista Brasileira de Fruticultura, 27(3):399–403, Dec 2005. ISSN 0100-2945. doi:10.1590/S0100-29452005000300015.
- Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics. EURASIP Journal on Image and Video Processing, 2008:1–10, 2008.
- Simple online and realtime tracking. In 2016 IEEE International Conference on Image Processing (ICIP), pages 3464–3468, 2016. doi:10.1109/ICIP.2016.7533003.
- YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint, 2020. doi:10.48550/arXiv.2004.10934.
- Leo Breiman. Bagging predictors. Machine learning, 24:123–140, 1996.
- Uso de redes neurais convolucionais para detecção de laranjas no campo. In In: Congresso Brasileiro de Agroinformática, 12., 2019, Indaiatuba. Anais, 2019.
- MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155, 2019.
- Support-vector networks. Machine learning, 20(3):273–297, 1995.
- David F. Crouse. On implementing 2d rectangular assignment algorithms. IEEE Transactions on Aerospace and Electronic Systems, 52(4):1679–1696, 2016. doi:10.1109/TAES.2016.140952.
- Andrew J. Davison. FutureMapping: The Computational Structure of Spatial AI Systems. arXiv preprint, 2018. doi:10.48550/ARXIV.1803.11288.
- APPLE MOTS: Detection, Segmentation and Tracking of Homogeneous Objects Using MOTS. IEEE Robotics and Automation Letters, 7:11418–11425, 10 2022. ISSN 23773766. doi:10.1109/LRA.2022.3199026.
- Frank Dellaert. Factor Graphs: Exploiting Structure in Robotics. Annual Review of Control, Robotics, and Autonomous Systems, 4:141–166, 5 2021. ISSN 2573-5144. doi:10.1146/annurev-control-061520-010504.
- Agricultural Robotics: The Future of Robotic Agriculture. arXiv preprint, UK-RAS Network White Papers, 2018. ISSN 2398-4414. doi:10.48550/ARXIV.1806.06762.
- Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 24(6):381–395, jun 1981. ISSN 0001-0782. doi:10.1145/358669.358692.
- Jerome H. Friedman. Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), October 2001.
- Fundecitrus. Relatório de atividades: junho 2021/maio 2022. Technical report, Fundo de Defesa da Citricultura (Fund for Citrus Protection) – Fundecitrus, 2022. URL https://www.fundecitrus.com.br/pdf/relatorios/2022.pdf.
- Fundecitrus. 2022-2023 Orange Crop Forecast for the São Paulo and West-Southwest Minas Gerais Citrus Belt: May Forecast. Technical report, Fundo de Defesa da Citricultura (Fund for Citrus Protection) – Fundecitrus, Araraquara, SP, 2022. URL https://www.fundecitrus.com.br/pdf/pes_relatorios/2022_07_01_Tree_Inventory_and_Orange_Crop_Forecast_2022-2023.pdf.
- Active thermal imaging for immature citrus fruit detection. Biosystems Engineering, 198:291–303, 10 2020. ISSN 1537-5110. doi:10.1016/J.BIOSYSTEMSENG.2020.08.015.
- A novel apple fruit detection and counting methodology based on deep learning and trunk tracking in modern orchard. Computers and Electronics in Agriculture, 197:107000, 6 2022. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.107000.
- Fuji-sfm dataset: A collection of annotated images and point clouds for fuji apple detection and location using structure-from-motion photogrammetry. Data in brief, 30:105591, 2020. doi:10.1016/j.dib.2020.105591.
- Aurélien Géron. Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. O’Reilly Media, Incorporated, 2019. ISBN 9781492032649. URL https://books.google.com.br/books?id=OCS1twEACAAJ.
- Gene H. Golub and Charles F. van Loan. Matrix Computations. JHU Press, fourth edition, 2013. ISBN 1421407949 9781421407944.
- Citrus yield estimation based on images processed by an android mobile phone. Biosystems Engineering, 115:162–170, 6 2013. ISSN 1537-5110. doi:10.1016/J.BIOSYSTEMSENG.2013.03.009.
- Sensors and systems for fruit detection and localization: A review. Computers and Electronics in Agriculture, 116:8–19, 2015. ISSN 0168-1699. doi:10.1016/j.compag.2015.05.021.
- Deep Learning. MIT Press, 2017.
- Minneapple: A benchmark dataset for apple detection and segmentation. IEEE Robotics and Automation Letters, pages 1–1, 1 2020. ISSN 23773766. doi:10.1109/lra.2020.2965061.
- Multiple View Geometry in Computer Vision. Cambridge University Press, 2 edition, 2004. doi:10.1017/CBO9780511811685.
- Fruit yield prediction and estimation in orchards: A state-of-the-art comprehensive review for both direct and indirect methods. Computers and Electronics in Agriculture, 195:106812, 4 2022a. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.106812.
- Cascade-SORT: A robust fruit counting approach using multiple features cascade matching. Computers and Electronics in Agriculture, 200:107223, 9 2022b. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.107223.
- Guoquan Huang. Visual-inertial navigation: A concise review. In 2019 International Conference on Robotics and Automation (ICRA), pages 9572–9582, 2019. doi:10.1109/ICRA.2019.8793604.
- A comparative study of fruit detection and counting methods for yield mapping in apple orchards. Journal of Field Robotics, 37(2):263–282, 2020. doi:10.1002/rob.21902.
- Multiple hypothesis tracking revisited. In 2015 IEEE International Conference on Computer Vision (ICCV), pages 4696–4704, 2015. doi:10.1109/ICCV.2015.533.
- Statistics for Biomedical Engineers and Scientists. Academic Press, 1 edition, 2019. doi:10.1016/C2018-0-02241-0.
- Ron Kohavi. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’95), pages 1137–1143, 1995.
- Deep learning for real-time fruit detection and orchard fruit load estimation: Benchmarking of ‘MangoYOLO’. Precision Agriculture, 20:1107–1135, 2019a. doi:10.1007/s11119-019-09642-0.
- Deep learning – method overview and review of use for fruit detection and yield estimation. Computers and Electronics in Agriculture, 162:219–234, 2019b. ISSN 0168-1699. doi:10.1016/j.compag.2019.04.017.
- Attempting to Estimate the Unseen—Correction for Occluded Fruit in Tree Fruit Load Estimation by Machine Vision with Deep Learning. Agronomy 2021, Vol. 11, Page 347, 11:347, 2 2021. ISSN 2073-4395. doi:10.3390/AGRONOMY11020347.
- Harold W. Kuhn. The Hungarian method for the assignment problem. Naval research logistics quarterly, 2(1-2):83–97, 1955.
- Deep learning. Nature, 521(7553):436–444, 2015.
- Yolov6: A single-stage object detection framework for industrial applications, 2022.
- Monocular camera based fruit counting and mapping with semantic data association. IEEE Robotics and Automation Letters, 4:2296–2303, 7 2019. ISSN 2377-3766. doi:10.1109/LRA.2019.2901987.
- A survey of public datasets for computer vision tasks in precision agriculture. Computers and Electronics in Agriculture, 178:105760, 2020. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2020.105760.
- HOTA: A higher order metric for evaluating multi-object tracking. International Journal of Computer Vision, 129:548–578, 2021. doi:10.1007/s11263-020-01375-2.
- Discriminative correlation filter with channel and spatial reliability. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017.
- Automatic green fruit counting in orange trees using digital images. Computers and Electronics in Agriculture, 127:572–581, 2016. ISSN 0168-1699. doi:10.1016/j.compag.2016.07.023.
- MOT16: A benchmark for multi-object tracking. arXiv preprint, 2016. doi:10.48550/ARXIV.1603.00831.
- Fruit detection and load estimation of an orange orchard using the yolo models through simple approaches in different imaging and illumination conditions. Computers and Electronics in Agriculture, 191:106533, 2021. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2021.106533.
- Scalable nearest neighbor algorithms for high dimensional data. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(11):2227–2240, 2014. doi:10.1109/TPAMI.2014.2321376.
- YOLOv3: An Incremental Improvement. arXiv preprint, 2018. doi:10.48550/ARXIV.1804.02767.
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39:1137–1149, 6 2017. ISSN 01628828. doi:10.1109/TPAMI.2016.2577031.
- ORB: An efficient alternative to SIFT or SURF. In 2011 International Conference on Computer Vision, pages 2564–2571, 2011. doi:10.1109/ICCV.2011.6126544.
- Deepfruits: A fruit detection system using deep neural networks. Sensors, 16:1222, 8 2016. ISSN 1424-8220. doi:10.3390/s16081222.
- eContaFruto OranDet: dataset for image-based orange detection, 2022. URL https://doi.org/10.48432/IG6VIQ.
- eContaFruto MOrangeT: dataset for image-based orange tracking, 2023. URL https://doi.org/10.48432/OI7BFG.
- Structure-from-motion revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016.
- Object detection with transformers: A review, 2023.
- Efficientdet: Scalable and efficient object detection. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10778–10787, 2020. doi:10.1109/CVPR42600.2020.01079.
- Apple detection during different growth stages in orchards using the improved yolo-v3 model. Computers and Electronics in Agriculture, 157:417–426, 2019. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2019.01.012.
- Apple orchard production estimation using deep learning strategies: A comparison of tracking-by-detection algorithms. Computers and Electronics in Agriculture, 204:107513, 1 2023. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.107513.
- Mots: Multi-object tracking and segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
- Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, 2022.
- Mango Fruit Load Estimation Using a Video Based MangoYOLO—Kalman Filter—Hungarian Algorithm Method. Sensors 2019, Vol. 19, Page 2742, 19:2742, 6 2019. ISSN 1424-8220. doi:10.3390/S19122742.
- Simple online and realtime tracking with a deep association metric. Proceedings - International Conference on Image Processing, ICIP, 2017-September:3645–3649, 2 2018. ISSN 15224880. doi:10.1109/ICIP.2017.8296962.
- Multilevel systematic sampling to estimate total fruit number for yield forecasts. Precision Agriculture, 13:256–275, 4 2012. ISSN 13852256. doi:10.1007/s11119-011-9245-2.
- Real-time and accurate detection of citrus in complex scenes based on hpl-yolov4. Computers and Electronics in Agriculture, 205:107590, 2023. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2022.107590.
- Pointtrack++ for effective online multi-object tracking and segmentation. arXiv preprint, 2020. doi:10.48550/ARXIV.2007.01549.
- Deep-learning-based in-field citrus fruit detection and tracking. Horticulture Research, 9, 1 2022. ISSN 20527276. doi:10.1093/HR/UHAC003.