Single-Stage Broad Multi-Instance Multi-Label Learning (BMIML) with Diverse Inter-Correlations and its application to medical image classification (2209.02625v2)
Abstract: described by multiple instances (e.g., image patches) and simultaneously associated with multiple labels. Existing MIML methods are useful in many applications but most of which suffer from relatively low accuracy and training efficiency due to several issues: i) the inter-label correlations(i.e., the probabilistic correlations between the multiple labels corresponding to an object) are neglected; ii) the inter-instance correlations (i.e., the probabilistic correlations of different instances in predicting the object label) cannot be learned directly (or jointly) with other types of correlations due to the missing instance labels; iii) diverse inter-correlations (e.g., inter-label correlations, inter-instance correlations) can only be learned in multiple stages. To resolve these issues, a new single-stage framework called broad multi-instance multi-label learning (BMIML) is proposed. In BMIML, there are three innovative modules: i) an auto-weighted label enhancement learning (AWLEL) based on broad learning system (BLS) is designed, which simultaneously and efficiently captures the inter-label correlations while traditional BLS cannot; ii) A specific MIML neural network called scalable multi-instance probabilistic regression (SMIPR) is constructed to effectively estimate the inter-instance correlations using the object label only, which can provide additional probabilistic information for learning; iii) Finally, an interactive decision optimization (IDO) is designed to combine and optimize the results from AWLEL and SMIPR and form a single-stage framework. Experiments show that BMIML is highly competitive to (or even better than) existing methods in accuracy and much faster than most MIML methods even for large medical image data sets (> 90K images).
- Z.-H. Zhou and M.-L. Zhang, “Multi-instance multi-label learning with application to scene classification,” in Advances in neural information processing systems, 2006, pp. 1609–1616.
- M. Jie and Z. Hong, “Image classification algorithm based on lts-hd multi instance multi label rbf,” in 2017 12th IEEE Conference on Industrial Electronics and Applications (ICIEA). IEEE, 2017, pp. 190–194.
- L. Song, J. Liu, B. Qian, M. Sun, K. Yang, M. Sun, and S. Abbas, “A deep multi-modal cnn for multi-instance multi-label image classification,” IEEE Transactions on Image Processing, vol. 27, no. 12, pp. 6025–6038, 2018.
- X.-Y. Zhang, H. Shi, C. Li, and P. Li, “Multi-instance multi-label action recognition and localization based on spatio-temporal pre-trimming for untrimmed videos,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 07, 2020, pp. 12 886–12 893.
- S. Biswas and J. Gall, “Multiple instance triplet loss for weakly supervised multi-label action localisation of interacting persons,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 2159–2167.
- T. Li, Y. Yang, and H.-B. Shen, “Hmiml: Hierarchical multi-instance multi-label learning of drosophila embryogenesis images using convolutional neural networks,” in 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2018, pp. 907–912.
- J.-S. Wu, S.-J. Huang, and Z.-H. Zhou, “Genome-wide protein function prediction through multi-instance multi-label learning,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, no. 5, pp. 891–902, 2014.
- Q. Chang, H. Qu, Y. Zhang, M. Sabuncu, C. Chen, T. Zhang, and D. N. Metaxas, “Synthetic learning: Learn from distributed asynchronized discriminator gan without sharing medical image data,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 13 856–13 866.
- T. Zhao, K. Cao, J. Yao, I. Nogues, L. Lu, L. Huang, J. Xiao, Z. Yin, and L. Zhang, “3d graph anatomy geometry-integrated network for pancreatic mass segmentation, diagnosis, and quantitative patient management,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 13 743–13 752.
- S. Zhou, D. Nie, E. Adeli, J. Yin, J. Lian, and D. Shen, “High-resolution encoder–decoder networks for low-contrast medical image segmentation,” IEEE Transactions on Image Processing, vol. 29, pp. 461–475, 2019.
- Y. Li, Y. Iwamoto, L. Lin, R. Xu, R. Tong, and Y.-W. Chen, “Volumenet: A lightweight parallel network for super-resolution of mr and ct volumetric data,” IEEE Transactions on Image Processing, vol. 30, pp. 4840–4854, 2021.
- K. Xu, Z. Zhao, J. Gu, Z. Zeng, C. W. Ying, L. K. Choon, T. C. Hua, and P. K. Chow, “Multi-instance multi-label learning for gene mutation prediction in hepatocellular carcinoma,” in 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, 2020, pp. 6095–6098.
- B. Li, Y. Li, and K. W. Eliceiri, “Dual-stream multiple instance learning network for whole slide image classification with self-supervised contrastive learning,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 14 318–14 328.
- W. Ji, S. Yu, J. Wu, K. Ma, C. Bian, Q. Bi, J. Li, H. Liu, L. Cheng, and Y. Zheng, “Learning calibrated medical image segmentation via multi-rater agreement modeling,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 12 341–12 351.
- L. Wang, Y. Liu, H. Di, C. Qin, G. Sun, and Y. Fu, “Semi-supervised dual relation learning for multi-label classification,” IEEE Transactions on Image Processing, vol. 30, pp. 9125–9135, 2021.
- Y. Xing, G. Yu, C. Domeniconi, J. Wang, Z. Zhang, and M. Guo, “Multi-view multi-instance multi-label learning based on collaborative matrix factorization,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, 2019, pp. 5508–5515.
- Y. Li, S. Wang, Q. Tian, and X. Ding, “A boosting approach to exploit instance correlations for multi-instance classification,” IEEE transactions on neural networks and learning systems, vol. 27, no. 12, pp. 2740–2747, 2015.
- Z. Chi, Z. Wang, and W. Du, “Explicit metric-based multiconcept multi-instance learning with triplet and superbag,” IEEE Transactions on Neural Networks and Learning Systems, 2021.
- S.-J. Huang, W. Gao, and Z.-H. Zhou, “Fast multi-instance multi-label learning,” IEEE transactions on pattern analysis and machine intelligence, vol. 41, no. 11, pp. 2614–2627, 2018.
- C. P. Chen and Z. Liu, “Broad learning system: An effective and efficient incremental learning system without the need for deep architecture,” IEEE transactions on neural networks and learning systems, vol. 29, no. 1, pp. 10–24, 2017.
- H. D. Nguyen, X.-S. Vu, and D.-T. Le, “Modular graph transformer networks for multi-label image classification,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 10, 2021, pp. 9092–9100.
- J. Ma and Y. Liu, “Latent topic-aware multi-label classification,” in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIV 16. Springer, 2020, pp. 558–573.
- Z.-H. Zhou, M.-L. Zhang, S.-J. Huang, and Y.-F. Li, “Multi-instance multi-label learning,” Artificial Intelligence, vol. 176, no. 1, pp. 2291–2320, 2012.
- M.-L. Zhang, “A k-nearest neighbor based multi-instance multi-label learning algorithm,” in 2010 22nd IEEE international conference on tools with artificial intelligence, vol. 2. IEEE, 2010, pp. 207–212.
- J. Feng and Z.-H. Zhou, “Deep miml network,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31, no. 1, 2017.
- J. Li, G. Zhao, Y. Tao, P. Zhai, H. Chen, H. He, and T. Cai, “Multi-task contrastive learning for automatic ct and x-ray diagnosis of covid-19,” Pattern Recognition, vol. 114, p. 107848, 2021.
- F. Chu, T. Liang, C. P. Chen, X. Wang, and X. Ma, “Weighted broad learning system and its application in nonlinear industrial process modeling,” IEEE transactions on neural networks and learning systems, vol. 31, no. 8, pp. 3017–3031, 2019.
- H. Ney, “On the probabilistic interpretation of neural network classifiers and discriminative training criteria,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 17, no. 2, pp. 107–119, 1995.
- C. Yuan and H. Yang, “Research on k-value selection method of k-means clustering algorithm,” J, vol. 2, no. 2, pp. 226–235, 2019.
- D. Karimi and S. E. Salcudean, “Reducing the hausdorff distance in medical image segmentation with convolutional neural networks,” IEEE Transactions on medical imaging, vol. 39, no. 2, pp. 499–513, 2019.
- B. Wahlberg, S. Boyd, M. Annergren, and Y. Wang, “An admm algorithm for a class of total variation regularized estimation problems,” IFAC Proceedings Volumes, vol. 45, no. 16, pp. 83–88, 2012.
- M. Amgad, L. A. Atteya, H. Hussein, K. H. Mohammed, E. Hafiz, M. A. Elsebaie, A. M. Alhusseiny, M. A. AlMoslemany, A. M. Elmatboly, P. A. Pappalardo et al., “Nucls: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentation,” arXiv preprint arXiv:2102.09099, 2021.
- M. Amgad, H. Elfandy, H. Hussein, L. A. Atteya, M. A. Elsebaie, L. S. Abo Elnasr, R. A. Sakr, H. S. Salem, A. F. Ismail, A. M. Saad et al., “Structured crowdsourcing enables convolutional segmentation of histology images,” Bioinformatics, vol. 35, no. 18, pp. 3461–3467, 2019.
- J. Gamper, N. A. Koohbanani, K. Benes, S. Graham, M. Jahanifar, S. A. Khurram, A. Azam, K. Hewitt, and N. Rajpoot, “Pannuke dataset extension, insights and baselines,” arXiv preprint arXiv:2003.10778, 2020.
- N. Li, T. Li, C. Hu, K. Wang, and H. Kang, “A benchmark of ocular disease intelligent recognition: one shot for multi-disease detection,” in International Symposium on Benchmarking, Measuring and Optimization. Springer, 2020, pp. 177–193.