Employing Iterative Feature Selection in Fuzzy Rule-Based Binary Classification (2401.16244v1)
Abstract: The feature selection in a traditional binary classification algorithm is always used in the stage of dataset preprocessing, which makes the obtained features not necessarily the best ones for the classification algorithm, thus affecting the classification performance. For a traditional rule-based binary classification algorithm, classification rules are usually deterministic, which results in the fuzzy information contained in the rules being ignored. To do so, this paper employs iterative feature selection in fuzzy rule-based binary classification. The proposed algorithm combines feature selection based on fuzzy correlation family with rule mining based on biclustering. It first conducts biclustering on the dataset after feature selection. Then it conducts feature selection again for the biclusters according to the feedback of biclusters evaluation. In this way, an iterative feature selection framework is build. During the iteration process, it stops until the obtained bicluster meets the requirements. In addition, the rule membership function is introduced to extract vectorized fuzzy rules from the bicluster and construct weak classifiers. The weak classifiers with good classification performance are selected by Adaptive Boosting and the strong classifier is constructed by "weighted average". Finally, we perform the proposed algorithm on different datasets and compare it with other peers. Experimental results show that it achieves good classification performance and outperforms its peers.
- Q. Huang, Y. Huang, Y. Luo, F. Yuan, and X. Li, “Segmentation of breast ultrasound image with semantic classification of superpixels,” Medical Image Analysis, vol. 61, pp. 101657, 2020.
- C. Sowmiya and P. Sumitra, “Analytical study of heart disease diagnosis using classification techniques,” in Proceedings of IEEE International Conference on Intelligent Techniques in Control, Optimization and Signal Processing, pp. 1–5, 2017.
- H. Riri, A. Elmoutaouakkil, A. Beni-Hssane, and F. Bourezgui, “Classification and recognition of dental images using a decisional tree,” in Proceeding of the 13th International Conference on Computer Graphics, Imaging and Visualization, pp. 390–393, 2016.
- S. Zhang, X. Li, M. Zong, X. Zhu, and R. Wang, “Efficient KNN classification with different numbers of nearest neighbors,” IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 5, pp. 1774–1785, 2018.
- C. Schuldt, I. Laptev, and B. Caputo, “Recognizing human actions: A local SVM approach,” in Proceedings of the 17th International Conference on Pattern Recognition, vol. 3, pp. 32–36, 2004.
- S. R. Safavian and D. Landgrebe, “A survey of decision tree classifier methodology,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 21, no. 3, pp. 660–674, 1991.
- J. Yang, Z. Ye, X. Zhang, W. Liu, and H. Jin, “Attribute weighted Naive Bayes for remote sensing image classification based on cuckoo search algorithm,” in Proceedings of 2017 International Conference on Security, Pattern Analysis, and Cybernetics, pp. 169–174, 2017.
- B. Bringmann, S. Nijssen, and A. Zimmermann, “Pattern-based classification: A unifying perspective,” arXiv preprint arXiv:1111.6191, 2011.
- X. Gu, P. Angelov, J. Han, and Q. Shen, “Multilayer Evolving Fuzzy Neural Networks,” IEEE Transactions on Fuzzy Systems, vol. 31, no. 12, pp. 4158–4169, 2023.
- Q. Huang, T. Wang, D. Tao, and X. Li, “Biclustering learning of trading rules,” IEEE Transactions on Cybernetics, vol. 45, no. 10, pp. 2287–2298, 2015.
- T. Lin and H. Lee, “Skin cancer dermoscopy images classification with meta data via deep learning ensemble,” in Proceedings of 2020 International Computer Symposium, pp. 237–241, 2020.
- A. Mahdavi-Shahri, M. Houshmand, M. Yaghoobi, and M. Jalali, “Applying an ensemble learning method for improving multi-label classification performance,” in Proceedings of the 2nd International Conference of Signal Processing and Intelligent Systems, pp. 1–6, 2016.
- T. K. Vyas, “A statistical classification of the benign and malignant neoplasm using ensemble learning and classification algorithms,” in Proceedings of the 12th International Conference on Computing Communication and Networking Technologies, pp. 1–5, 2015.
- S. G. Devi and M. Sabrigiriraj, “Feature selection, online feature selection techniques for big data classification: A review,” in Proceedings of 2018 International Conference on Current Trends towards Converging Technologies, pp. 1–9, 2018.
- F. Tasnim and S. U. Habiba, “A comparative study on heart disease prediction using data mining techniques and feature selection,” in Proceedings of the 2nd International Conference on Robotics, Electrical and Signal Processing Techniques, pp. 338–341, 2021.
- A. Haq, J. Li, M. H. Memon, M. H. Memon, J. Khan, and S. M. Marium, “Heart disease prediction system using model of machine learning and sequential backward selection algorithm for features selection,” in Proceedings of IEEE 5th International Conference for Convergence in Technology, pp. 1–4, 2019.
- T. Yin, H. Chen, Z. Yuan, J. Wan, K. Liu, S. Horng, and T. Li, “A Robust Multilabel Feature Selection Approach Based on Graph Structure Considering Fuzzy Dependency and Feature Interaction,” IEEE Transactions on Fuzzy Systems, vol. 31, no. 12, pp. 4516–4528, 2023.
- K. Li, J. Lu, H. Zuo, and G. Zhang,“Source-Free Multidomain Adaptation With Fuzzy Rule-Based Deep Neural Networks,” in IEEE Transactions on Fuzzy Systems, vol. 31, no. 12, pp. 4180–4194, 2023.
- Y. Zhang, M. Chadli, and Z. Xiang,“Prescribed-Time Formation Control for a Class of Multiagent Systems via Fuzzy Reinforcement Learning,” in IEEE Transactions on Fuzzy Systems, vol. 31, no. 12, pp. 4195–4204, 2023.
- C. Wang, W. Pedrycz, Z. Li, M. Zhou, and S. Sam Ge, “G-Image segmentation: Similarity-preserving Fuzzy C-Means with spatial information constraint in wavelet space,” IEEE Transactions on Fuzzy Systems, vol. 29, no. 12, pp. 3887–3898, 2021.
- C. Wang, W. Pedrycz, Z. Li, and M. Zhou, “Residual-driven Fuzzy C-Means clustering for image segmentation,” IEEE/CAA Journal of Automatica Sinica, vol. 8, no. 4, pp. 876–889, 2021.
- C. Wang, Z. Yan, W. Pedrycz, M. Zhou, and Z. Li, “A weighted fidelity and regularization-based method for mixed or unknown noise removal from images on graphs,” IEEE Transactions on Image Processing, vol. 29, pp. 5229–5243, 2020.
- C. Wang, W. Pedrycz, J. Yang, M. Zhou, and Z. Li, “Wavelet frame-based Fuzzy C-Means clustering for segmenting images on graphs,” IEEE Transactions on Cybernetics, vol. 50, no. 9, pp. 3938–3949, 2020.
- S. Das and U. R. Jena, “Texture classification using combination of LBP and GLRLM features along with KNN and multiclass SVM classification,” in Proceedings of the 2nd International Conference on Communication Control and Intelligent Systems, pp. 115–119, 2016.
- M. Yang, S. Cui, Y. Zhang, J. Zhang, and X. Li, “Data and image classification of haematococcus pluvialis based on SVM Algorithm,” in Proceedings of 2021 China Automation Congress, pp. 522–525, 2021.
- S. Sardari and M. Eftekhari, “A fuzzy decision tree approach for imbalanced data classification,” in Proceedings of the 6th International Conference on Computer and Knowledge Engineering, pp. 292–297, 2016.
- S. K. Sunori, L. Babu, J. Shermina, D. Sam, S. Justin, S. Maurya, and P. Juneja, “Classification of rainfall levels using various machine learning techniques,” in Proceedings of 2021 IEEE Mysore Sub Section International Conference, pp. 1–6, 2021.
- Y. Cheng and G. Church, “Biclustering of expression data,” in Proceedings of International Conference on Intelligent Systems for Molecular Biology, vol. 8, pp. 93–103, 2000.
- S. Bleuler, A. Prelic, and E. Zitzler, “An EA framework for biclustering of gene expression data,” in Proceedings of the 2004 Congress on Evolutionary Computation, vol. 1, pp. 166–173, 2004.
- F. Divina and J. S. Aguilar-Ruiz, “Biclustering of expression data with evolutionary computation,” IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 5, pp. 590–602, 2006.
- Q. Huang, X. Huang, Z. Kong, X. Li, and D. Tao, “Bi-phase evolutionary Searching for biclusters in gene expression data,” IEEE Transactions on Evolutionary Computation, vol. 23, no. 5, pp. 803–814, 2019.
- T. G. Dietterich, “Ensemble methods in machine learning,” in Proceedings of International Workshgp on Multiple Classifier Systems, 2000.
- B. V. Dasarathy and B. V. Sheela, “A composite classifier system design: concepts and methodology,” Proceedings of the IEEE, vol. 67, no. 5, pp. 708–713, 1979.
- R. Polikar, “Ensemble based systems in decision making,” IEEE Circuits and Systems Magazine, vol. 6, no. 3, pp. 21–45, 2006.
- C. A. Lupascu, D. Tegolo, and E. Trucco, “FABC: Retinal vessel segmentation using AdaBoost,” IEEE Transactions on Information Technology in Biomedicine, vol. 14, no. 5, pp. 1267–1274, 2010.
- M. Galar, A. Fernandez, E. Barrenechea, H. Bustince, and F. Herrera, “A review on ensembles for the class imbalance problem: Bagging-, Boosting-, and Hybrid-based approaches,” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 42, no. 4, pp. 463–484, 2012.
- W. Lin, Z. Wu, L. Lin, A. Wen, and J. Li, “An ensemble random forest algorithm for insurance big data analysis,” IEEE Access, vol. 5, pp. 16568–16575, 2017.
- Y. Liu and Y. Zhou, “An ensemble learning approach for COVID-19 fact verification,” in Proceedings of the 3rd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering, pp. 383–387, 2022.
- Y. Luo, Z. Lu, L. Liu, and Q. Huang, “Deep fusion of human-machine knowledge with attention mechanism for breast cancer diagnosis,” Biomedical Signal Processing and Control, vol. 84, pp. 104784, 2023.
- J. Xi, D. Sun, C. Chang, S. Zhou, and Q. Huang, “An omics-to-omics joint knowledge association subtensor model for radiogenomics cross-modal modules from genomics and ultrasonic images of breast cancers,” Computers in Biology and Medicine, vol. 155, pp. 106672, 2023.
- Q. Huang, H. Luo, C. Yang, J. Li, Q. Deng, P. Liu, M. Fu, L. Li, and X. Li, “Anatomical prior based vertebra modelling for reappearance of human spines,” Neurocomputing, vol. 500, pp. 750–760, 2022.
- G. Li, C. An, J. Yu, and Q. Huang, “Radiomics analysis of ultrasonic image predicts sensitive effects of microwave ablation in treatment of patient with benign breast tumors,” Biomedical Signal Processing and Control, vol. 76, pp. 103722, 2022.
- Y. Luo, Q.Huang, and X. Li, “Segmentation information with attention integration for classification of breast tumor in ultrasound image,” Pattern Recognition, vol. 124, pp. 108427, 2022.
- J. Xi, Z. Miao, L. Liu, X. Yang, W. Zhang, Q. Huang, and X. Li, “Knowledge tensor embedding framework with association enhancement for breast ultrasound diagnosis of limited labeled samples,” Neurocomputing, vol. 468, pp. 60–70, 2022.
- Q. Huang and L. Ye, “Multi-task/Single-task joint learning of ultrasound BI-RADS features,” IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control, vol. 69, no. 2, pp. 691–701, 2022.
- Q. Huang, Z. Miao, S. Zhou, C. Chang, and X. Li, “Dense prediction and local fusion of superpixels: A framework for breast anatomy segmentation in ultrasound image with scarce data,” IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–8, 2021.
- Y. Luo, Q. Huang, and L. Liu, “Classification of tumor in one single ultrasound image via a novel multi-view learning strategy,” Pattern Recognition, vol. 143, pp. 109776, 2023.
- Q. Huang, D. Tao, X. Li, L. Jin, and G. Wei. “Exploiting local coherent patterns for unsupervised feature ranking,” IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 41, no. 6, pp. 1471–1482, 2011.
- Q. Huang, Y. Chen, L. Liu, D. Tao, and X. Li, “On combining biclustering mining and AdaBoost for breast tumor classification,” IEEE Transactions on Knowledge and Data Engineering, vol. 32, no. 4, pp. 728–738, 2020.
- M. Eftekhari, A. Mehrpooya, F. Saberi-Movahed, and V. Torra. “How fuzzy concepts constribute to machine learning,” Springer, 2022.
- H. K. Bhuyan, D. C. Chakraborty, S. K. Pani, and V. Ravi. “Feature and Subfeature Selection for Classification Using Correlation Coefficient and Fuzzy Model,” IEEE Transactions on Engineering Management, vol. 70, no. 5, pp. 1655–1669, 2023.
- P. A. Ejegwa, V. Adah, and I. C. Onyeke. “Some modified Pythagorean fuzzy correlation measures with application in determining some selected decision-making problems,” Granular Computing, vol. 7, pp. 381–391, 2022.
- A. Skowron and C. Rauszer, “The discernibility matrices and functions in information systems,” Intelligent Decision Support, pp. 331–362, 1992.
- T. Yang, T. Li, and G. Lang, “Attribute reduction based on related families of fuzzy covering rough sets,” in Proceedings of 2017 International Conference on Fuzzy Theory and Its Applications, pp. 1–5, 2017.
- L. A. Zadeh, “Fuzzy sets,” Information & Control, vol. 8, no. 4, pp. 338–353, 1965.
- C. Wang, Y. Qian, W. Ding, and X. Fan, “Feature selection with fuzzy-rough minimum classification error criterion,” IEEE Transactions on Fuzzy Systems, vol. 30, no. 8, pp. 2930–2942, 2022.
- A. Kumar and P. S. Prasad, “Scalable fuzzy rough set reduct computation using fuzzy min-max neural network preprocessing,” IEEE Transactions on Fuzzy Systems, vol. 28, no. 5, pp. 953–964, 2020.
- Z. Su, Q. Hu, and T. Denœux, “A distributed rough evidential K-NN classifier: Integrating feature reduction and classification,” IEEE Transactions on Fuzzy Systems, vol. 29, no. 8, pp. 2322–2335, 2021.
- M. Mokhtia, M. Eftekhari, and F. Saberi-Movahed. “Dual-manifold regularized regression models for feature selection based on hesitant fuzzy correlation,” Knowledge-Based Systems, vol. 229, pp. 107308, 2021.