Transformer with Selective Shuffled Position Embedding and Key-Patch Exchange Strategy for Early Detection of Knee Osteoarthritis (2304.08364v2)
Abstract: Knee OsteoArthritis (KOA) is a widespread musculoskeletal disorder that can severely impact the mobility of older individuals. Insufficient medical data presents a significant obstacle for effectively training models due to the high cost associated with data labelling. Currently, deep learning-based models extensively utilize data augmentation techniques to improve their generalization ability and alleviate overfitting. However, conventional data augmentation techniques are primarily based on the original data and fail to introduce substantial diversity to the dataset. In this paper, we propose a novel approach based on the Vision Transformer (ViT) model with original Selective Shuffled Position Embedding (SSPE) and key-patch exchange strategies to obtain different input sequences as a method of data augmentation for early detection of KOA (KL-0 vs KL-2). More specifically, we fix and shuffle the position embedding of key and non-key patches, respectively. Then, for the target image, we randomly select other candidate images from the training set to exchange their key patches and thus obtain different input sequences. Finally, a hybrid loss function is developed by incorporating multiple loss functions for different types of the sequences. According to the experimental results, the generated data are considered valid as they lead to a notable improvement in the model's classification performance.
- Osteoarthritis: a disease of the joint as an organ. Arthritis and rheumatism, 64(6):1697, 2012.
- Epidemiology and burden of osteoarthritis. British Medical Bulletin, 105(1):185–199, 01 2013.
- Conservative treatment of knee osteoarthritis: A review of the literature. World Journal of Orthopedics, 13(3):212, 2022.
- Total knee replacement as a knee osteoarthritis outcome: predictors derived from a 4-year long-term observation following a randomized clinical trial using chondroitin sulfate. Cartilage, 4(3):219–226, 2013.
- Radiological Assessment of Osteo-Arthrosis. Annals of the Rheumatic Diseases, 16(4):494, 1957.
- Knee x-ray image analysis method for automated detection of osteoarthritis. IEEE Transactions on Biomedical Engineering, 56(2):407–415, 2008.
- Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25:1097–1105, 2012.
- A complex network based approach for knee osteoarthritis detection: Data from the osteoarthritis initiative. Biomedical Signal Processing and Control, 71:103133, 2022.
- Drinet for medical image segmentation. IEEE Transactions on Medical Imaging, 37(11):2453–2462, 2018.
- Discriminative regularized auto-encoder for early detection of knee osteoarthritis: data from the osteoarthritis initiative. IEEE transactions on medical imaging, 39(9):2976–2984, 2020.
- A discriminative shape-texture convolutional neural network for early diagnosis of knee osteoarthritis from x-ray images. Physical and Engineering Sciences in Medicine, pages 1–11, 2023.
- Automatic knee osteoarthritis diagnosis from plain radiographs: A deep learning-based approach. Scientific Reports, 2018.
- Knee osteoarthritis severity grading using vision transformer. Journal of Intelligent & Fuzzy Systems, (Preprint):1–11, 2022.
- Improving classification results on a small medical dataset using a gan; an outlook for dealing with rare disease datasets. Frontiers in Computer Science, page 102, 2022.
- A survey on image data augmentation for deep learning. Journal of big data, 6(1):1–48, 2019.
- Siamese-gap network for early detection of knee osteoarthritis. In 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI), pages 1–4, 2022.
- A confident labelling strategy based on deep learning for improving early detection of knee osteoarthritis. arXiv preprint arXiv:2303.13203, 2023.
- Key-exchange convolutional auto-encoder for data augmentation in early knee osteoarthritis classification. arXiv preprint arXiv:2302.13336, 2023.
- Knee joint distraction compared to total knee arthroplasty for treatment of end stage osteoarthritis: Simulating long-term outcomes and cost-effectiveness. PLOS ONE, 11(5):1–13, 05 2016.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- G. Lester. The Osteoarthritis Initiative: A NIH Public–Private Partnership. HSS Journal: The Musculoskeletal Journal of Hospital for Special Surgery, 8(1):62–63, 2011.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Adapting transformer to end-to-end spoken language translation. In Proceedings of INTERSPEECH 2019, pages 1133–1137. International Speech Communication Association (ISCA), 2019.
- Large scale legal text classification using transformer models. arXiv preprint arXiv:2010.12871, 2020.
- Transformers in vision: A survey. ACM computing surveys (CSUR), 54(10s):1–41, 2022.
- Character-level language modeling with deeper self-attention. In Proceedings of the AAAI conference on artificial intelligence, volume 33, pages 3159–3166, 2019.
- A survey on vision transformer. IEEE transactions on pattern analysis and machine intelligence, 45(1):87–110, 2022.
- Crossvit: Cross-attention multi-scale vision transformer for image classification. In Proceedings of the IEEE/CVF international conference on computer vision, pages 357–366, 2021.
- End-to-end object detection with transformers. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, pages 213–229. Springer, 2020.
- Missformer: An effective medical image segmentation transformer. arXiv preprint arXiv:2109.07162, 2021.
- When does label smoothing help? Advances in neural information processing systems, 32, 2019.
- Fully automatic knee osteoarthritis severity grading using deep neural networks with a novel ordinal loss. Computerized Medical Imaging and Graphics, 75:84–92, 2019.
- J. Redmon and A. Farhadi. Yolo9000: better, faster, stronger. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7263–7271, 2017.
- Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16:321–357, 2002.
- Pytorch: An imperative style, high-performance deep learning library, 2019.
- Grad-cam: Visual explanations from deep networks via gradient-based localization. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 618–626, 2017.