Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images (2403.11249v2)

Published 17 Mar 2024 in eess.IV and cs.CV

Abstract: The introduction of YOLOv9, the latest version of the You Only Look Once (YOLO) series, has led to its widespread adoption across various scenarios. This paper is the first to apply the YOLOv9 algorithm model to the fracture detection task as computer-assisted diagnosis (CAD) to help radiologists and surgeons to interpret X-ray images. Specifically, this paper trained the model on the GRAZPEDWRI-DX dataset and extended the training set using data augmentation techniques to improve the model performance. Experimental results demonstrate that compared to the mAP 50-95 of the current state-of-the-art (SOTA) model, the YOLOv9 model increased the value from 42.16% to 43.73%, with an improvement of 3.7%. The implementation code is publicly available at https://github.com/RuiyangJu/YOLOv9-Fracture-Detection.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. \bibinfojournalActa orthopaedica \bibinfovolume89(\bibinfonumber4), \bibinfopages468–473 (\bibinfoyear2018)
  2. \bibinfojournalInvestigative radiology \bibinfovolume55(\bibinfonumber2), \bibinfopages101–110 (\bibinfoyear2020)
  3. \bibinfojournalEuropean journal of radiology \bibinfovolume133, \bibinfopages109373 (\bibinfoyear2020)
  4. \bibinfojournalCanadian Association of Radiologists Journal \bibinfovolume72(\bibinfonumber1), \bibinfopages60–72 (\bibinfoyear2021)
  5. \bibinfojournalActa orthopaedica \bibinfovolume90(\bibinfonumber4), \bibinfopages394–400 (\bibinfoyear2019)
  6. In: \bibinfobooktitleIntelligent Computing: Proceedings of the 2019 Computing Conference, Volume 1, pp.  \bibinfopages971–981. \bibinfoorganizationSpringer (\bibinfoyear2019)
  7. \bibinfojournalEuropean journal of radiology \bibinfovolume126, \bibinfopages108925 (\bibinfoyear2020)
  8. In: \bibinfobooktitleProceedings of the IEEE conference on computer vision and pattern recognition, pp.  \bibinfopages779–788. (\bibinfoyear2016)
  9. \bibinfojournalarXiv preprint arXiv:2004.10934 (\bibinfoyear2020)
  10. \bibinfojournalarXiv preprint arXiv:2107.08430 (\bibinfoyear2021)
  11. \bibinfojournalarXiv preprint arXiv:2105.04206 (\bibinfoyear2021)
  12. \bibinfoauthorGlenn, J.: \bibinfotitleUltralytics yolov5. GitHub. \bibinfohowpublishedhttps://github.com/ultralytics/yolov5 (\bibinfoyear2022)
  13. \bibinfojournalarXiv preprint arXiv:2209.02976 (\bibinfoyear2022)
  14. \bibinfojournalMultimedia Tools and Applications pp.  \bibinfopages1–17. (\bibinfoyear2023)
  15. In: \bibinfobooktitleProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  \bibinfopages7464–7475. (\bibinfoyear2023)
  16. \bibinfoauthorGlenn, J.: \bibinfotitleUltralytics yolov8. GitHub. \bibinfohowpublishedhttps://github.com/ultralytics/ultralytics (\bibinfoyear2023)
  17. In: \bibinfobooktitle2022 37th International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC), pp.  \bibinfopages1054–1057. \bibinfoorganizationIEEE (\bibinfoyear2022)
  18. \bibinfojournalMathematics \bibinfovolume10(\bibinfonumber16), \bibinfopages2939 (\bibinfoyear2022)
  19. \bibinfojournalDiagnostics \bibinfovolume13(\bibinfonumber20), \bibinfopages3245 (\bibinfoyear2023)
  20. \bibinfojournalScientific Data \bibinfovolume9(\bibinfonumber1), \bibinfopages222 (\bibinfoyear2022)
  21. \bibinfojournalarXiv preprint arXiv:2304.05071 (\bibinfoyear2023)
  22. In: \bibinfobooktitleICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.  \bibinfopages2235–2239. \bibinfoorganizationIEEE (\bibinfoyear2021)
  23. In: \bibinfobooktitleProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp.  \bibinfopages11534–11542. (\bibinfoyear2020)
  24. \bibinfojournalarXiv preprint arXiv:2112.05561 (\bibinfoyear2021)
  25. In: \bibinfobooktitleProceedings of the European conference on computer vision (ECCV), pp.  \bibinfopages3–19. (\bibinfoyear2018)
  26. \bibinfojournalarXiv preprint arXiv:2402.13616 (\bibinfoyear2024)
  27. In: \bibinfobooktitleComputer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pp.  \bibinfopages740–755. \bibinfoorganizationSpringer (\bibinfoyear2014)
  28. \bibinfojournalLife \bibinfovolume12(\bibinfonumber11), \bibinfopages1711 (\bibinfoyear2022)
  29. In: \bibinfobooktitleMedical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pp.  \bibinfopages234–241. \bibinfoorganizationSpringer (\bibinfoyear2015)
  30. \bibinfojournalDiagnostics \bibinfovolume14(\bibinfonumber1), \bibinfopages11 (\bibinfoyear2023)
  31. \bibinfojournalBiomedical Signal Processing and Control \bibinfovolume93, \bibinfopages106144 (\bibinfoyear2024)
  32. \bibinfojournalInternational Journal of Oral and Maxillofacial Surgery \bibinfovolume51(\bibinfonumber11), \bibinfopages1488–1494 (\bibinfoyear2022)
  33. \bibinfojournalAustralian Journal of Multi-Disciplinary Engineering pp.  \bibinfopages1–9. (\bibinfoyear2024)
  34. \bibinfojournalBiomedical Signal Processing and Control \bibinfovolume91, \bibinfopages105995 (\bibinfoyear2024)
  35. In: \bibinfobooktitleProceedings of the IEEE conference on computer vision and pattern recognition, pp.  \bibinfopages7132–7141. (\bibinfoyear2018)
  36. \bibinfojournalScientific Data \bibinfovolume10(\bibinfonumber1), \bibinfopages521 (\bibinfoyear2023)
  37. \bibinfojournalIEEE Access \bibinfovolume10, \bibinfopages79061–79070 (\bibinfoyear2022)
  38. \bibinfojournalarXiv preprint arXiv:2402.09329 (\bibinfoyear2024)
  39. In: \bibinfobooktitleProceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp.  \bibinfopages390–391. (\bibinfoyear2020)
  40. \bibinfojournalarXiv preprint arXiv:2211.04800 (\bibinfoyear2022)
  41. \bibinfoauthorRuder, S.: \bibinfotitleAn overview of gradient descent optimization algorithms. \bibinfojournalarXiv preprint arXiv:1609.04747 (\bibinfoyear2016)
Citations (22)

Summary

  • The paper demonstrates the use of YOLOv9 for pediatric fracture detection, achieving a mean average precision of 43.73% on the GRAZPEDWRI-DX dataset.
  • It leverages Programmable Gradient Information and GELAN to address information loss in low-feature X-ray images.
  • The study balances detection accuracy with computational efficiency, supporting real-time clinical applications.

YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images: A Detailed Overview

The paper "YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images" presents an innovative application of the YOLOv9 algorithm within the field of computer-assisted diagnosis (CAD) for medical imaging, specifically targeting fracture detection in pediatric wrist trauma. This research leverages the GRAZPEDWRI-DX dataset to offer substantial improvements over previous state-of-the-art methods in terms of detection accuracy, employing advanced deep learning techniques to augment clinical interpretations of X-ray images.

Core Contributions and Findings

The primary contribution of the paper is the application of the YOLOv9 model, a recent iteration in the YOLO series known for its utility in real-time object detection, to a medical imaging context. Key highlights of utilizing the YOLOv9 model include:

  1. Enhanced Performance: The model was trained on the GRAZPEDWRI-DX dataset using data augmentation, which resulted in a notable improvement in performance, achieving a mean average precision (mAP 50-95) of 43.73%. This represents an increase from the existing state-of-the-art (YOLOv8+SA) and demonstrates the model's efficacy in fracture detection tasks.
  2. Addressing Information Loss: The paper tackles the issue of information loss from low-featured X-ray images by utilizing Programmable Gradient Information (PGI) and the Generalized Efficient Layer Aggregation Network (GELAN) within the YOLOv9 architecture. These enhancements facilitate more robust learning and feature extraction from X-ray images, which typically suffer from low informational content.
  3. Prototype Methodology for Real-Time Applications: The YOLOv9 model maintains a balance between accuracy and computational efficiency, crucial for mobile and web-based applications. It ensures that CAD systems for pediatric wrist fracture detection are not only precise but also feasible to deploy in real-time clinical settings.

Methodological Insights

The YOLOv9 algorithm introduces the following innovations that underpin its application for fracture detection:

  • Programmable Gradient Information (PGI): This auxiliary supervision framework enriches gradient propagation, essential for processing the low-feature domains typical of X-ray images, thus enhancing detection accuracy by preserving feature integrity during deeper network layers.
  • Generalized Efficient Layer Aggregation Network (GELAN): By combining CSPNet and ELAN, GELAN contributes to efficient inter-layer information flow, reducing computational demands while improving feature integration. This makes YOLOv9 particularly adaptive for deployment in resource-constrained environments.

Experimental Setup and Results

In their experimentation, the authors conducted model evaluations on the GRAZPEDWRI-DX dataset under varying input image sizes (640 and 1024), demonstrating significant performance gains across both settings. These results validate the model's enhanced feature learning capabilities and its adaptability to different computational resource availabilities, reflecting in improvements over prior methods, specifically in detecting fracture instances.

Practical and Theoretical Implications

This paper’s implications are twofold:

  • Clinical Applicability: The demonstrated enhancement in fracture detection equips clinical practitioners with improved diagnostic tools, potentially aiding in more accurate and timely diagnoses, especially in resource-limited environments where specialized radiologists may not be readily available.
  • Theoretical Advancements: Methodologically, YOLOv9's application suggests avenues for further research into lightweight, efficient models for medical imaging tasks, balancing the trade-off between computational feasibility and accuracy.

Future Directions

The paper also identifies areas for future work, such as enriching training datasets to improve class prediction diversity, particularly for underrepresented classes like "bone anomaly" and "soft tissue." Further, it encourages the exploration of YOLOv9’s application to other medical imaging datasets to generalize its utility across different types of diagnostic tasks beyond pediatric wrist fractures.

In conclusion, this paper presents a significant advance in utilizing deep learning architectures within the field of medical imaging, showing promise for further enhancements in CAD systems tailored to challenging clinical use cases.

X Twitter Logo Streamline Icon: https://streamlinehq.com
Youtube Logo Streamline Icon: https://streamlinehq.com