Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A pipeline for multiple orange detection and tracking with 3-D fruit relocalization and neural-net based yield regression in commercial citrus orchards (2312.16724v1)

Published 27 Dec 2023 in cs.CV

Abstract: Traditionally, sweet orange crop forecasting has involved manually counting fruits from numerous trees, which is a labor-intensive process. Automatic systems for fruit counting, based on proximal imaging, computer vision, and machine learning, have been considered a promising alternative or complement to manual counting. These systems require data association components that prevent multiple counting of the same fruit observed in different images. However, there is a lack of work evaluating the accuracy of multiple fruit counting, especially considering (i) occluded and re-entering green fruits on leafy trees, and (ii) counting ground-truth data measured in the crop field. We propose a non-invasive alternative that utilizes fruit counting from videos, implemented as a pipeline. Firstly, we employ CNNs for the detection of visible fruits. Inter-frame association techniques are then applied to track the fruits across frames. To handle occluded and re-appeared fruit, we introduce a relocalization component that employs 3-D estimation of fruit locations. Finally, a neural network regressor is utilized to estimate the total number of fruit, integrating image-based fruit counting with other tree data such as crop variety and tree size. The results demonstrate that the performance of our approach is closely tied to the quality of the field-collected videos. By ensuring that at least 30% of the fruit is accurately detected, tracked, and counted, our yield regressor achieves an impressive coefficient of determination of 0.85. To the best of our knowledge, this study represents one of the few endeavors in fruit estimation that incorporates manual fruit counting as a reference point for evaluation. We also introduce annotated datasets for multiple orange tracking (MOrangeT) and detection (OranDet), publicly available to foster the development of novel methods for image-based fruit counting.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (68)
  1. A survey: object detection methods from CNN to transformer. Multimedia Tools and Applications, 82:21353–21383, 6 2023. ISSN 15737721. doi:10.1007/S11042-022-13801-3.
  2. Caracterização fenológica de variedades do tipo murcott em três porta-enxertos. Revista Brasileira de Fruticultura, 27(3):399–403, Dec 2005. ISSN 0100-2945. doi:10.1590/S0100-29452005000300015.
  3. Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics. EURASIP Journal on Image and Video Processing, 2008:1–10, 2008.
  4. Simple online and realtime tracking. In 2016 IEEE International Conference on Image Processing (ICIP), pages 3464–3468, 2016. doi:10.1109/ICIP.2016.7533003.
  5. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint, 2020. doi:10.48550/arXiv.2004.10934.
  6. Leo Breiman. Bagging predictors. Machine learning, 24:123–140, 1996.
  7. Uso de redes neurais convolucionais para detecção de laranjas no campo. In In: Congresso Brasileiro de Agroinformática, 12., 2019, Indaiatuba. Anais, 2019.
  8. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155, 2019.
  9. Support-vector networks. Machine learning, 20(3):273–297, 1995.
  10. David F. Crouse. On implementing 2d rectangular assignment algorithms. IEEE Transactions on Aerospace and Electronic Systems, 52(4):1679–1696, 2016. doi:10.1109/TAES.2016.140952.
  11. Andrew J. Davison. FutureMapping: The Computational Structure of Spatial AI Systems. arXiv preprint, 2018. doi:10.48550/ARXIV.1803.11288.
  12. APPLE MOTS: Detection, Segmentation and Tracking of Homogeneous Objects Using MOTS. IEEE Robotics and Automation Letters, 7:11418–11425, 10 2022. ISSN 23773766. doi:10.1109/LRA.2022.3199026.
  13. Frank Dellaert. Factor Graphs: Exploiting Structure in Robotics. Annual Review of Control, Robotics, and Autonomous Systems, 4:141–166, 5 2021. ISSN 2573-5144. doi:10.1146/annurev-control-061520-010504.
  14. Agricultural Robotics: The Future of Robotic Agriculture. arXiv preprint, UK-RAS Network White Papers, 2018. ISSN 2398-4414. doi:10.48550/ARXIV.1806.06762.
  15. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 24(6):381–395, jun 1981. ISSN 0001-0782. doi:10.1145/358669.358692.
  16. Jerome H. Friedman. Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), October 2001.
  17. Fundecitrus. Relatório de atividades: junho 2021/maio 2022. Technical report, Fundo de Defesa da Citricultura (Fund for Citrus Protection) – Fundecitrus, 2022. URL https://www.fundecitrus.com.br/pdf/relatorios/2022.pdf.
  18. Fundecitrus. 2022-2023 Orange Crop Forecast for the São Paulo and West-Southwest Minas Gerais Citrus Belt: May Forecast. Technical report, Fundo de Defesa da Citricultura (Fund for Citrus Protection) – Fundecitrus, Araraquara, SP, 2022. URL https://www.fundecitrus.com.br/pdf/pes_relatorios/2022_07_01_Tree_Inventory_and_Orange_Crop_Forecast_2022-2023.pdf.
  19. Active thermal imaging for immature citrus fruit detection. Biosystems Engineering, 198:291–303, 10 2020. ISSN 1537-5110. doi:10.1016/J.BIOSYSTEMSENG.2020.08.015.
  20. A novel apple fruit detection and counting methodology based on deep learning and trunk tracking in modern orchard. Computers and Electronics in Agriculture, 197:107000, 6 2022. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.107000.
  21. Fuji-sfm dataset: A collection of annotated images and point clouds for fuji apple detection and location using structure-from-motion photogrammetry. Data in brief, 30:105591, 2020. doi:10.1016/j.dib.2020.105591.
  22. Aurélien Géron. Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. O’Reilly Media, Incorporated, 2019. ISBN 9781492032649. URL https://books.google.com.br/books?id=OCS1twEACAAJ.
  23. Gene H. Golub and Charles F. van Loan. Matrix Computations. JHU Press, fourth edition, 2013. ISBN 1421407949 9781421407944.
  24. Citrus yield estimation based on images processed by an android mobile phone. Biosystems Engineering, 115:162–170, 6 2013. ISSN 1537-5110. doi:10.1016/J.BIOSYSTEMSENG.2013.03.009.
  25. Sensors and systems for fruit detection and localization: A review. Computers and Electronics in Agriculture, 116:8–19, 2015. ISSN 0168-1699. doi:10.1016/j.compag.2015.05.021.
  26. Deep Learning. MIT Press, 2017.
  27. Minneapple: A benchmark dataset for apple detection and segmentation. IEEE Robotics and Automation Letters, pages 1–1, 1 2020. ISSN 23773766. doi:10.1109/lra.2020.2965061.
  28. Multiple View Geometry in Computer Vision. Cambridge University Press, 2 edition, 2004. doi:10.1017/CBO9780511811685.
  29. Fruit yield prediction and estimation in orchards: A state-of-the-art comprehensive review for both direct and indirect methods. Computers and Electronics in Agriculture, 195:106812, 4 2022a. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.106812.
  30. Cascade-SORT: A robust fruit counting approach using multiple features cascade matching. Computers and Electronics in Agriculture, 200:107223, 9 2022b. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.107223.
  31. Guoquan Huang. Visual-inertial navigation: A concise review. In 2019 International Conference on Robotics and Automation (ICRA), pages 9572–9582, 2019. doi:10.1109/ICRA.2019.8793604.
  32. A comparative study of fruit detection and counting methods for yield mapping in apple orchards. Journal of Field Robotics, 37(2):263–282, 2020. doi:10.1002/rob.21902.
  33. Multiple hypothesis tracking revisited. In 2015 IEEE International Conference on Computer Vision (ICCV), pages 4696–4704, 2015. doi:10.1109/ICCV.2015.533.
  34. Statistics for Biomedical Engineers and Scientists. Academic Press, 1 edition, 2019. doi:10.1016/C2018-0-02241-0.
  35. Ron Kohavi. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI’95), pages 1137–1143, 1995.
  36. Deep learning for real-time fruit detection and orchard fruit load estimation: Benchmarking of ‘MangoYOLO’. Precision Agriculture, 20:1107–1135, 2019a. doi:10.1007/s11119-019-09642-0.
  37. Deep learning – method overview and review of use for fruit detection and yield estimation. Computers and Electronics in Agriculture, 162:219–234, 2019b. ISSN 0168-1699. doi:10.1016/j.compag.2019.04.017.
  38. Attempting to Estimate the Unseen—Correction for Occluded Fruit in Tree Fruit Load Estimation by Machine Vision with Deep Learning. Agronomy 2021, Vol. 11, Page 347, 11:347, 2 2021. ISSN 2073-4395. doi:10.3390/AGRONOMY11020347.
  39. Harold W. Kuhn. The Hungarian method for the assignment problem. Naval research logistics quarterly, 2(1-2):83–97, 1955.
  40. Deep learning. Nature, 521(7553):436–444, 2015.
  41. Yolov6: A single-stage object detection framework for industrial applications, 2022.
  42. Monocular camera based fruit counting and mapping with semantic data association. IEEE Robotics and Automation Letters, 4:2296–2303, 7 2019. ISSN 2377-3766. doi:10.1109/LRA.2019.2901987.
  43. A survey of public datasets for computer vision tasks in precision agriculture. Computers and Electronics in Agriculture, 178:105760, 2020. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2020.105760.
  44. HOTA: A higher order metric for evaluating multi-object tracking. International Journal of Computer Vision, 129:548–578, 2021. doi:10.1007/s11263-020-01375-2.
  45. Discriminative correlation filter with channel and spatial reliability. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017.
  46. Automatic green fruit counting in orange trees using digital images. Computers and Electronics in Agriculture, 127:572–581, 2016. ISSN 0168-1699. doi:10.1016/j.compag.2016.07.023.
  47. MOT16: A benchmark for multi-object tracking. arXiv preprint, 2016. doi:10.48550/ARXIV.1603.00831.
  48. Fruit detection and load estimation of an orange orchard using the yolo models through simple approaches in different imaging and illumination conditions. Computers and Electronics in Agriculture, 191:106533, 2021. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2021.106533.
  49. Scalable nearest neighbor algorithms for high dimensional data. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(11):2227–2240, 2014. doi:10.1109/TPAMI.2014.2321376.
  50. YOLOv3: An Incremental Improvement. arXiv preprint, 2018. doi:10.48550/ARXIV.1804.02767.
  51. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39:1137–1149, 6 2017. ISSN 01628828. doi:10.1109/TPAMI.2016.2577031.
  52. ORB: An efficient alternative to SIFT or SURF. In 2011 International Conference on Computer Vision, pages 2564–2571, 2011. doi:10.1109/ICCV.2011.6126544.
  53. Deepfruits: A fruit detection system using deep neural networks. Sensors, 16:1222, 8 2016. ISSN 1424-8220. doi:10.3390/s16081222.
  54. eContaFruto OranDet: dataset for image-based orange detection, 2022. URL https://doi.org/10.48432/IG6VIQ.
  55. eContaFruto MOrangeT: dataset for image-based orange tracking, 2023. URL https://doi.org/10.48432/OI7BFG.
  56. Structure-from-motion revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016.
  57. Object detection with transformers: A review, 2023.
  58. Efficientdet: Scalable and efficient object detection. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10778–10787, 2020. doi:10.1109/CVPR42600.2020.01079.
  59. Apple detection during different growth stages in orchards using the improved yolo-v3 model. Computers and Electronics in Agriculture, 157:417–426, 2019. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2019.01.012.
  60. Apple orchard production estimation using deep learning strategies: A comparison of tracking-by-detection algorithms. Computers and Electronics in Agriculture, 204:107513, 1 2023. ISSN 0168-1699. doi:10.1016/J.COMPAG.2022.107513.
  61. Mots: Multi-object tracking and segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
  62. Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, 2022.
  63. Mango Fruit Load Estimation Using a Video Based MangoYOLO—Kalman Filter—Hungarian Algorithm Method. Sensors 2019, Vol. 19, Page 2742, 19:2742, 6 2019. ISSN 1424-8220. doi:10.3390/S19122742.
  64. Simple online and realtime tracking with a deep association metric. Proceedings - International Conference on Image Processing, ICIP, 2017-September:3645–3649, 2 2018. ISSN 15224880. doi:10.1109/ICIP.2017.8296962.
  65. Multilevel systematic sampling to estimate total fruit number for yield forecasts. Precision Agriculture, 13:256–275, 4 2012. ISSN 13852256. doi:10.1007/s11119-011-9245-2.
  66. Real-time and accurate detection of citrus in complex scenes based on hpl-yolov4. Computers and Electronics in Agriculture, 205:107590, 2023. ISSN 0168-1699. doi:https://doi.org/10.1016/j.compag.2022.107590.
  67. Pointtrack++ for effective online multi-object tracking and segmentation. arXiv preprint, 2020. doi:10.48550/ARXIV.2007.01549.
  68. Deep-learning-based in-field citrus fruit detection and tracking. Horticulture Research, 9, 1 2022. ISSN 20527276. doi:10.1093/HR/UHAC003.

Summary

  • The paper introduces a video-based pipeline that combines CNN detection with 3-D fruit relocalization and neural-net yield regression.
  • It employs tracking algorithms, including the Hungarian algorithm, to accurately detect and track oranges under occlusion and variable lighting.
  • Empirical results show a high prediction accuracy (R²=0.85) when at least 30% of fruits are tracked, marking a notable advance in automated citrus yield estimation.

Introduction to the Problem

The task of yield prediction in citrus orchards involves estimating the number of fruits produced by trees. Traditional methods require labor-intensive manual counting, which can be highly demanding. Automation through imaging, computer vision, and machine learning offers a promising alternative. Despite the advances in automated fruit detection, accurate multiple fruit counting, particularly where fruits are hidden or re-appear in cluttered tree canopies, remains a scientific and technical challenge.

Pipeline Overview

A new approach proposed comprises a pipeline that uses video footage for fruit detection and incorporates tracking algorithms to handle challenges like fruits obscured by leaves or falling outside the camera view. This pipeline includes a step for relocalizing the fruits based on their three-dimensional locations and culminates with a neural network-based yield regressor that estimates the total fruit count, taking into account additional tree data like variety and size.

Fruit Detection and Tracking

The first phase of the pipeline involves using convolutional neural networks (CNNs) to detect oranges from video inputs, which are then tracked across frames. When fruits become occluded, a relocalization mechanism based on 3-D projection comes into play, ensuring fruits are accounted for even if they temporarily disappear from view. The tracking methodology incorporates Hungarian algorithm for frame-to-frame association and adjusts fruit locations by referencing camera motion data. The results show that accurate tracking is achievable even under less-than-ideal conditions such as varying light and motion blur.

Yield Estimation

The final phase uses the output from the tracking phase - the number of detected fruits on individual trees - as input for a neural network regressor. This tool integrates the counts with other attributes, including tree variety, height, and age, to produce a total yield estimate. Empirical results indicate that the regressor can achieve a high coefficient of determination (0.85) when at least 30% of the fruits are correctly detected and tracked. The comprehensive evaluation demonstrates that while perfect fruit tracking eludes, the pipeline can still yield accurate counting results, offering a significant advancement for yield prediction in commercial citrus orchards.

Conclusion and Future Work

This paper presents a robust solution for the challenging task of citrus fruit yield estimation using a video-based detection and tracking pipeline. Future research directions involve refining the pipeline for even better accuracy and adapting it for other agricultural environments. Such advancement represents a major step forward in digital agriculture, promoting efficiency and accuracy in citrus orchard management.

Youtube Logo Streamline Icon: https://streamlinehq.com