Open Continual Feature Selection via Granular-Ball Knowledge Transfer (2403.10253v1)
Abstract: This paper presents a novel framework for continual feature selection (CFS) in data preprocessing, particularly in the context of an open and dynamic environment where unknown classes may emerge. CFS encounters two primary challenges: the discovery of unknown knowledge and the transfer of known knowledge. To this end, the proposed CFS method combines the strengths of continual learning (CL) with granular-ball computing (GBC), which focuses on constructing a granular-ball knowledge base to detect unknown classes and facilitate the transfer of previously learned knowledge for further feature selection. CFS consists of two stages: initial learning and open learning. The former aims to establish an initial knowledge base through multi-granularity representation using granular-balls. The latter utilizes prior granular-ball knowledge to identify unknowns, updates the knowledge base for granular-ball knowledge transfer, reinforces old knowledge, and integrates new knowledge. Subsequently, we devise an optimal feature subset mechanism that incorporates minimal new features into the existing optimal subset, often yielding superior results during each period. Extensive experimental results on public benchmark datasets demonstrate our method's superiority in terms of both effectiveness and efficiency compared to state-of-the-art feature selection methods.
- J. Zhu, Y. Liu, C. Wen, and X. Wu, “Dgdfs: Dependence guided discriminative feature selection for predicting adverse drug-drug interaction,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 1, pp. 271–285, 2022.
- M. You, A. Yuan, M. Zou, D. jian He, and X. Li, “Robust unsupervised feature selection via multi-group adaptive graph representation,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 3, pp. 3030–3044, 2023.
- P. Maji, “A rough hypercuboid approach for feature selection in approximation spaces,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 16–29, 2012.
- Z. Zhou, “Open-environment machine learning,” National Science Review, vol. 9, no. 8, p. nwac123, 2022.
- R. Aljundi, K. Kelchtermans, and T. Tuytelaars, “Task-free continual learning,” in CVPR, 2019, pp. 11 254–11 263.
- M. Mundt, Y. Hong, I. Pliushch, and V. Ramesh, “A wholistic view of continual learning with deep neural networks: Forgotten lessons and the bridge to active and open world learning,” Neural Networks, vol. 160, pp. 306–336, 2023.
- S. Xia, X. Bai, G. Wang, Y. Cheng, D. Meng, X. Gao, Y. Zhai, and E. Giem, “An efficient and accurate rough set for feature selection, classification, and knowledge representation,” IEEE Transactions on Knowledge and Data Engineering, 2022.
- J. Xie, W. Kong, S. Xia, G. Wang, and X. Gao, “An efficient spectral clustering algorithm based on granular-ball,” IEEE Transactions on Knowledge and Data Engineering, 2023.
- S. Xia, H. Zhang, W. Li, G. Wang, E. Giem, and Z. Chen, “Gbnrs: A novel rough set algorithm for fast adaptive attribute reduction in classification,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 3, pp. 1231–1242, 2020.
- C. Wang, Q. Hu, X. Wang, D. Chen, Y. Qian, and Z. Dong, “Feature selection based on neighborhood discrimination index,” IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 7, pp. 2986–2999, 2017.
- G. Roffo, S. Melzi, U. Castellani, A. Vinciarelli, and M. Cristani, “Infinite feature selection: a graph-based feature filtering approach,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 12, pp. 4396–4410, 2020.
- S. Wang, F. Nie, Z. Wang, R. Wang, and X. Li, “Outliers robust unsupervised feature selection for structured sparse subspace,” IEEE Transactions on Knowledge and Data Engineering, pp. 1–14, 2023.
- Z. Liu, “An incremental arithmetic for the smallest reduction of attributes,” Acta Electronica Sinica, vol. 27, no. 11, pp. 96–98, 1999.
- D. Chen, Y. Yang, and Z. Dong, “An incremental algorithm for attribute reduction with variable precision rough sets,” Applied Soft Computing, vol. 45, pp. 129–149, 2016.
- J. Liang, F. Wang, C. Dang, and Y. Qian, “A group incremental approach to feature selection applying rough set technique,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 2, pp. 294–308, 2014.
- X. Zhang, C. Mei, D. Chen, Y. Yang, and J. Li, “Active incremental feature selection using a fuzzy-rough-set-based information entropy,” IEEE Transactions on Fuzzy Systems, vol. 28, no. 5, pp. 901–915, 2019.
- Y. Yang, D. Chen, H. Wang, and X. Wang, “Incremental perspective for feature selection based on fuzzy rough sets,” IEEE Transactions on Fuzzy Systems, vol. 26, no. 3, pp. 1257–1273, 2017.
- T. Li, D. Ruan, W. Geert, J. Song, and Y. Xu, “A rough sets based characteristic relation approach for dynamic attribute generalization in data mining,” Knowledge-Based Systems, vol. 20, no. 5, pp. 485–494, 2007.
- F. Wang, J. Liang, and Y. Qian, “Attribute reduction: a dimension incremental strategy,” Knowledge-Based Systems, vol. 39, pp. 95–108, 2013.
- W. Qian, W. Shu, and C. Zhang, “Feature selection from the perspective of knowledge granulation in dynamic set-valued information system.” Journal of Information Science & Engineering, vol. 32, no. 3, 2016.
- J. Liu, Y. Lin, J. Du, H. Zhang, Z. Chen, and J. Zhang, “Asfs: A novel streaming feature selection for multi-label data based on neighborhood rough set,” Applied Intelligence, vol. 53, no. 2, pp. 1707–1724, 2023.
- D. You, Y. Wang, J. Xiao, Y. Lin, M. Pan, Z. Chen, L. Shen, and X. Wu, “Online multi-label streaming feature selection with label correlation,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 3, pp. 2901–2915, 2023.
- X. Yan, A. Homaifar, M. Sarkar, B. Lartey, and K. D. Gupta, “An online unsupervised streaming features selection through dynamic feature clustering,” IEEE Transactions on Artificial Intelligence, vol. 4, no. 5, pp. 1281–1292, 2023.
- F. Wang, J. Liang, and C. Dang, “Attribute reduction for dynamic data sets,” Applied Soft Computing, vol. 13, no. 1, pp. 676–689, 2013.
- W. Shu and H. Shen, “Incremental feature selection based on rough set in dynamic incomplete data,” Pattern Recognition, vol. 47, no. 12, pp. 3890–3906, 2014.
- X. Xie and X. Qin, “A novel incremental attribute reduction approach for dynamic incomplete decision systems,” International Journal of Approximate Reasoning, vol. 93, pp. 443–462, 2018.
- H. Chen, T. Li, C. Luo, S. Horng, and G. Wang, “A rough set-based method for updating decision rules on attribute values’ coarsening and refining,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 12, pp. 2886–2899, 2014.
- S. Xia, X. Dai, G. Wang, X. Gao, and E. Giem, “An efficient and adaptive granular-ball generation method in classification problem,” IEEE Transactions on Neural Networks and Learning Systems, 2022.
- S. Xia, G. Wang, and X. Gao, “Granular ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method,” arXiv preprint arXiv:2304.11171, 2023.
- Y. Fang, X. Cao, X. Wang, and F. Min, “Hypersphere neighborhood rough set for rapid attribute reduction,” in PAKDD. Springer, 2022, pp. 161–173.
- X. Mu, K. M. Ting, and Z. Zhou, “Classification under streaming emerging new classes: A solution using completely-random trees,” IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 8, pp. 1605–1618, 2017.
- M. Ester, H. P. Kriegel, J. Sander, X. Xu et al., “A density-based algorithm for discovering clusters in large spatial databases with noise,” in KDD, vol. 96, no. 34, 1996, pp. 226–231.
- C. Wang, Y. Huang, M. Shao, Q. Hu, and D. Chen, “Feature selection based on neighborhood self-information,” IEEE Transactions on Cybernetics, vol. 50, no. 9, pp. 4031–4042, 2019.
- Y. Fang, X. Cao, X. Wang, and F. Min, “Three-way sampling for rapid attribute reduction,” Information Sciences, vol. 609, pp. 26–45, 2022.
- J. Fan, Y. Jiang, and Y. Liu, “Quick attribute reduction with generalized indiscernibility models,” Information Sciences, vol. 397, pp. 15–36, 2017.
- Q. Zhang, C. Wu, S. Xia, F. Zhao, M. Gao, Y. Cheng, and G. Wang, “Incremental learning based on granular ball rough sets for classification in dynamic mixed-type decision system,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 9, pp. 9319–9332, 2023.