Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement (2402.19001v3)
Abstract: Accurate classification of laryngeal vascular as benign or malignant is crucial for early detection of laryngeal cancer. However, organizations with limited access to laryngeal vascular images face challenges due to the lack of large and homogeneous public datasets for effective learning. Distinguished from the most familiar works, which directly transfer the ImageNet pre-trained models to the target domain for fine-tuning, this work pioneers exploring two-step heterogeneous transfer learning (THTL) for laryngeal lesion classification with nine deep-learning models, utilizing the diabetic retinopathy color fundus images, semantically non-identical yet vascular images, as the intermediate domain. Attention visualization technique, Layer Class Activate Map (LayerCAM), reveals a novel finding that yet the intermediate and the target domain both reflect vascular structure to a certain extent, the prevalent radial vascular pattern in the intermediate domain prevents learning the features of twisted and tangled vessels that distinguish the malignant class in the target domain, summarizes a vital rule for laryngeal lesion classification using THTL. To address this, we introduce an enhanced fine-tuning strategy in THTL called Step-Wise Fine-Tuning (SWFT) and apply it to the ResNet models. SWFT progressively refines model performance by accumulating fine-tuning layers from back to front, guided by the visualization results of LayerCAM. Comparison with the original THTL approach shows significant improvements. For ResNet18, the accuracy and malignant recall increases by 26.1% and 79.8%, respectively, while for ResNet50, these indicators improve by 20.4% and 62.2%, respectively.
- Double-shot transfer learning for breast cancer classification from x-ray images. Applied Sciences 10, 3999.
- Novel transfer learning approach for medical imaging with limited labeled data. Cancers 13, 1590.
- Optimizing the performance of breast cancer classification by employing the same domain transfer learning from hybrid deep convolutional neural network model. Electronics 9, 445.
- Learned and handcrafted features for early-stage laryngeal scc diagnosis. Medical & Biological Engineering & Computing 57, 2683–2692.
- Classification of brain tumours types based on mri images using mobilenet, in: 2021 2nd International Conference on Innovative and Creative Information Technology (ICITech), pp. 69–73. doi:10.1109/ICITech50181.2021.9590183.
- A new transfer learning based approach to magnification dependent and independent classification of breast cancer in histopathological images. Biomedical Signal Processing and Control 63, 102192.
- Transformer for computer-aided diagnosis of laryngeal carcinoma in pcle images, in: 2021 International Conference on Networking Systems of AI (INSAI), pp. 181–188. doi:10.1109/INSAI54028.2021.00042.
- Narrow-band imaging in oncologic otorhinolaryngology: State of the art. European Annals of Otorhinolaryngology, Head and Neck Diseases 138, 451–458.
- Diagnostic accuracies of laryngeal diseases using a convolutional neural network-based image classification system. The Laryngoscope 131, 2558–2566.
- A survey on heterogeneous transfer learning. Journal of Big Data 4, 1–42.
- Double transfer learning for breast cancer histopathologic image classification, in: 2019 international joint conference on neural networks (IJCNN), IEEE. pp. 1–8.
- Imagenet: A large-scale hierarchical image database, in: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. doi:10.1109/CVPR.2009.5206848.
- Global burden of larynx cancer, 1990-2017: estimates from the global burden of disease 2017 study. Aging (Albany NY) 12, 2545.
- An image is worth 16x16 words: Transformers for image recognition at scale, in: International Conference on Learning Representations. URL: https://openreview.net/forum?id=YicbFdNTTy.
- Diabetic retinopathy detection. URL: https://kaggle.com/competitions/diabetic-retinopathy-detection.
- Contact Endoscopy – Narrow Band Imaging (CE-NBI) Data Set for Laryngeal Lesion Assessment. URL: https://doi.org/10.5281/zenodo.6674034, doi:10.5281/zenodo.6674034.
- Novel automated vessel pattern characterization of larynx contact endoscopic video images. International journal of computer assisted radiology and surgery 14, 1751–1761.
- Deep residual learning for image recognition, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778.
- A deep convolutional neural network-based method for laryngeal squamous cell carcinoma diagnosis. Annals of Translational Medicine 9.
- Joint liver lesion segmentation and classification via transfer learning. arXiv preprint arXiv:2004.12352 .
- Densely connected convolutional networks, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708.
- Layercam: Exploring hierarchical class activation maps for localization. IEEE Transactions on Image Processing 30, 5875–5888. doi:10.1109/TIP.2021.3089943.
- Transfer learning with convolutional neural networks for diabetic retinopathy image classification. a review. Applied Sciences 10, 2021.
- Adam: A method for stochastic optimization. arXiv:1412.6980.
- Narrow band imaging (nbi)—endoscopic method for detection of head and neck cancer. Endoscopy 5, 75–87.
- Hierarchical convolutional features for visual tracking, in: Proceedings of the IEEE International Conference on Computer Vision (ICCV).
- What makes transfer learning work for medical images: Feature reuse & other factors, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9225–9234.
- Tl-med: A two-stage transfer learning recognition model for medical images of covid-19. Biocybernetics and Biomedical Engineering 42, 842–855.
- A scoping review of transfer learning research on medical image analysis using imagenet. Computers in biology and medicine 128, 104115.
- Learning and transferring mid-level image representations using convolutional neural networks, in: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1717–1724. doi:10.1109/CVPR.2014.222.
- A survey on transfer learning. IEEE Transactions on knowledge and data engineering 22, 1345–1359.
- Pytorch: An imperative style, high-performance deep learning library, in: Wallach, H., Larochelle, H., Beygelzimer, A., d'Alché-Buc, F., Fox, E., Garnett, R. (Eds.), Advances in Neural Information Processing Systems, Curran Associates, Inc. URL: https://proceedings.neurips.cc/paper_files/paper/2019/file/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf.
- Transfusion: Understanding transfer learning for medical imaging. Advances in neural information processing systems 32.
- Neural transfer learning for natural language processing. Ph.D. thesis. NUI Galway.
- Mobilenetv2: Inverted residuals and linear bottlenecks, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Effect of layer-wise fine-tuning in magnification-dependent classification of breast cancer histopathological image. The Visual Computer 36, 1755–1769.
- Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 .
- Going deeper with convolutions, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Rethinking the inception architecture for computer vision, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Convolutional neural networks for medical image analysis: Full training or fine tuning? IEEE transactions on medical imaging 35, 1299–1312.
- Distant domain transfer learning, in: Proceedings of the AAAI conference on artificial intelligence.
- Efficientnetv2: Smaller models and faster training, in: Meila, M., Zhang, T. (Eds.), Proceedings of the 38th International Conference on Machine Learning, PMLR. pp. 10096–10106. URL: https://proceedings.mlr.press/v139/tan21a.html.
- Transfer learning with adaptive fine-tuning. IEEE Access 8, 196197–196211. doi:10.1109/ACCESS.2020.3034343.
- Pre-training on grayscale imagenet improves medical image classification, in: Proceedings of the European Conference on Computer Vision (ECCV) Workshops.
- Computer-aided diagnosis of laryngeal cancer via deep learning based on laryngoscopic images. EBioMedicine 48, 92–99.
- Visualizing and understanding convolutional networks, in: Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part I 13, Springer. pp. 818–833.
- Learning deep features for discriminative localization, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2921–2929.