Single-Shared Network with Prior-Inspired Loss for Parameter-Efficient Multi-Modal Imaging Skin Lesion Classification (2403.19203v1)
Abstract: In this study, we introduce a multi-modal approach that efficiently integrates multi-scale clinical and dermoscopy features within a single network, thereby substantially reducing model parameters. The proposed method includes three novel fusion schemes. Firstly, unlike current methods that usually employ two individual models for for clinical and dermoscopy modalities, we verified that multimodal feature can be learned by sharing the parameters of encoder while leaving the individual modal-specific classifiers. Secondly, the shared cross-attention module can replace the individual one to efficiently interact between two modalities at multiple layers. Thirdly, different from current methods that equally optimize dermoscopy and clinical branches, inspired by prior knowledge that dermoscopy images play a more significant role than clinical images, we propose a novel biased loss. This loss guides the single-shared network to prioritize dermoscopy information over clinical information, implicitly learning a better joint feature representation for the modal-specific task. Extensive experiments on a well-recognized Seven-Point Checklist (SPC) dataset and a collected dataset demonstrate the effectiveness of our method on both CNN and Transformer structures. Furthermore, our method exhibits superiority in both accuracy and model parameters compared to currently advanced methods.
- Multi-scale weight sharing network for image recognition. Pattern Recognition Letters 131, 348–354.
- Final version of 2009 ajcc melanoma staging and classification. Journal of clinical oncology 27, 6199.
- Multi-label classification of multi-modality skin lesion via hyper-connected convolutional neural network. Pattern Recognition 107, 107502.
- Non-melanoma skin cancer diagnosis: a comparison between dermoscopic and smartphone images by unified visual and sonification deep learning algorithms. Journal of cancer research and clinical oncology , 1–9.
- Dermatologist-level classification of skin cancer with deep neural networks. nature 542, 115–118.
- Graph-based intercategory and intermodality network for multilabel classification and melanoma diagnosis of skin lesions in dermoscopy and clinical images. IEEE Transactions on Medical Imaging 41, 3266–3277.
- Multi-view compression and collaboration for skin disease diagnosis. Expert Systems with Applications , 123395.
- Skin disease recognition using deep saliency features and multimodal learning of dermoscopy and clinical images, in: Medical Image Computing and Computer Assisted Intervention- MICCAI 2017: 20th International Conference, Quebec City, QC, Canada, September 11-13, 2017, Proceedings, Part III 20, Springer. pp. 250–258.
- Progressive transfer learning and adversarial domain adaptation for cross-domain skin disease classification. IEEE journal of biomedical and health informatics 24, 1379–1393.
- Deep residual learning for image recognition, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778.
- Co-attention fusion network for multimodal skin cancer diagnosis. Pattern Recognition 133, 108990.
- Densely connected convolutional networks, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708.
- Learning where to learn in cross-view self-supervised learning, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14451–14460.
- Averaging weights leads to wider optima and better generalization. arXiv preprint arXiv:1803.05407 .
- Seven-point checklist and skin lesion classification using multitask multimodal neural nets. IEEE journal of biomedical and health informatics 23, 538–546.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 .
- Anatomy and physiology of the skin. Journal of the Dermatology Nurses’ Association 3, 203–213.
- Swin transformer: Hierarchical vision transformer using shifted windows, in: Proceedings of the IEEE/CVF international conference on computer vision, pp. 10012–10022.
- A convnet for the 2020s, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11976–11986.
- Ci-net: clinical-inspired network for automated skin lesion recognition. IEEE Transactions on Medical Imaging 42, 619–632.
- Pad-ufes-20: A skin lesion dataset composed of patient data and clinical images collected from smartphones. Data in brief 32, 106221.
- A dataset of skin lesion images collected in argentina for the evaluation of ai tools in this population. Scientific Data 10, 712.
- Self-supervised representation learning on neural network weights for model characteristic prediction. Advances in Neural Information Processing Systems 34, 16481–16493.
- Cancer statistics, 2022. CA: a cancer journal for clinicians 72, 7–33.
- A benchmark for automatic visual classification of clinical skin disease images, in: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VI 14, Springer. pp. 206–222.
- Gp-cnn-dtel: Global-part cnn model with data-transformed ensemble learning for skin lesion classification. IEEE journal of biomedical and health informatics 24, 2870–2882.
- Fusionm4net: A multi-stage multi-modal learning algorithm for multi-label skin lesion classification. Medical Image Analysis 76, 102307.
- Exploring the equivalence of siamese self-supervised learning via a unified gradient framework, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14431–14440.
- Comparison of images obtained using four dermoscope imaging devices: An observational study. JEADV Clinical Practice 2, 888–892.
- Adaptive decomposition and shared weight volumetric transformer blocks for efficient patch-free 3d medical image segmentation. IEEE Journal of Biomedical and Health Informatics .
- Adversarial multimodal fusion with attention mechanism for skin lesion classification using clinical and dermoscopic images. Medical Image Analysis 81, 102535.
- Learning deep multimodal feature representation with asymmetric multi-layer fusion, in: Proceedings of the 28th ACM International Conference on Multimedia, pp. 3902–3910.
- Clinical skin lesion diagnosis using representations inspired by dermatologist criteria, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1258–1266.
- Self-paced balance learning for clinical skin disease recognition. IEEE transactions on neural networks and learning systems 31, 2832–2846.
- Single model deep learning on imbalanced small datasets for skin lesion classification. IEEE transactions on medical imaging 41, 1242–1254.
- Multimodal skin lesion classification using deep learning. Experimental dermatology 27, 1261–1267.
- Tformer: A throughout fusion transformer for multi-modal skin lesion diagnosis. Computers in Biology and Medicine 157, 106712.