Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs (2306.10623v1)

Published 18 Jun 2023 in cs.CV

Abstract: The computer-assisted radiologic informative report has received increasing research attention to facilitate diagnosis and treatment planning for dental care providers. However, manual interpretation of dental images is limited, expensive, and time-consuming. Another barrier in dental imaging is the limited number of available images for training, which is a challenge in the era of deep learning. This study proposes a novel self-distillation (SD) enhanced self-supervised learning on top of the masked image modeling (SimMIM) Transformer, called SD-SimMIM, to improve the outcome with a limited number of dental radiographs. In addition to the prediction loss on masked patches, SD-SimMIM computes the self-distillation loss on the visible patches. We apply SD-SimMIM on dental panoramic X-rays for teeth numbering, detection of dental restorations and orthodontic appliances, and instance segmentation tasks. Our results show that SD-SimMIM outperforms other self-supervised learning methods. Furthermore, we augment and improve the annotation of an existing dataset of panoramic X-rays.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (18)
  1. “Parameters of radiologic care: An official report of the american academy of oral and maxillofacial radiology,” Oral Surgery, Oral Medicine, Oral Pathology, Oral Radiology, and Endodontology, vol. 91, no. 5, pp. 498–511, 2001.
  2. VE Rushton and K Horner, “The use of panoramic radiology in dental practice,” Journal of dentistry, vol. 24, no. 3, pp. 185–201, 1996.
  3. DB Smith, “The numbering of teeth,” New Zealand School Dental Service gazette, vol. 37, no. 4, pp. 56, 1976.
  4. Björn Molander, “Panoramic radiography in dental diagnostics.,” Swedish Dental journal. Supplement, vol. 119, pp. 1–26, 1996.
  5. “Swin transformer: Hierarchical vision transformer using shifted windows,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 10012–10022.
  6. “Simmim: A simple framework for masked image modeling,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 9653–9663.
  7. “Masked autoencoders are scalable vision learners,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 16000–16009.
  8. “Uniform masking: Enabling mae pre-training for pyramid-based vision transformers with locality,” arXiv preprint arXiv:2205.10063, 2022.
  9. “An image is worth 16x16 words: Transformers for image recognition at scale,” ICLR, 2021.
  10. “Self-distillation augmented masked autoencoders for histopathological image classification,” arXiv preprint arXiv:2203.16983, 2022.
  11. “A study on tooth segmentation and numbering using end-to-end deep neural networks,” in 2020 33rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI). IEEE, 2020, pp. 164–171.
  12. “Self-supervised learning with masked image modeling for teeth numbering, detection of dental restorations, and instance segmentation in dental panoramic radiographs,” in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023, pp. 5594–5603.
  13. “Distilling the knowledge in a neural network,” NIPS Deep Learning and Representation Learning Workshop, 2014.
  14. “Be your own teacher: Improve the performance of convolutional neural networks via self distillation,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 3713–3722.
  15. Justin Brooks, “Coco annotator,” https://github.com/jsbroks/coco-annotator/, 2019.
  16. “Pytorch: An imperative style, high-performance deep learning library,” Advances in neural information processing systems, vol. 32, 2019.
  17. “Decoupled weight decay regularization,” ICLR, 2019.
  18. “Emerging properties in self-supervised vision transformers,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 9650–9660.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Amani Almalki (3 papers)
  2. Longin Jan Latecki (25 papers)
Citations (3)