Abstract
This work presents a multilabel feature selection approach via label enhancement and weighted neighborhood mutual information. First, the Fuzzy C-Means (FCM) clustering is optimized by the Whale Optimization Algorithm (WOA) to obtain the initial value of the cluster centers, and then in the iterative process, the FCM clustering algorithm is updated to ensure fast convergence and avoid local optimization. Secondly, the association matrix is constructed through the membership degree of each sample obtained by the FCM clustering, and a fuzzy synthesis operation is performed to obtain the label enhancement strategy. Thirdly, label weights are introduced into the traditional neighborhood mutual information to improve the handling effect of imbalanced labels. Feature weights are calculated via the maximum information coefficient to determine the weighted sample neighborhoods, and this weighted neighborhood mutual information assesses redundancy between the candidate and selected features. Finally, a feature selection algorithm via weighted neighborhood mutual information is designed for multilabel data classification. Those comparative experiments are performed on 11 multilabel datasets. The experimental results show that the constructed algorithm effectively improves the classification effect of multilabel datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Disclosure of Interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work.
References
Sun, L., Ma, Y.X., Ding, W.P., Lu, Z.H., Xu, J.C.: LSFSR: local label correlation-based sparse multilabel feature selection with feature redundancy. Inf. Sci. 667, 120501 (2024)
Zhao, K.D., Ya, P., Jia, Z.Y., Ji, Y.: General fuzzy C-means clustering algorithm using Minkowski metric. Signal Process. 188, 108161 (2021)
Chen, Y., Chen, Y.Y., Hou, X.Y., Jiang, L.J., Liao, L.: A neighborhood granule fuzzy C-means clustering algorithm. J. Shandong Univ. (Nat. Sci.) 59(3), 1–10 (2024)
Verma, H., Verma, D., Tiwari, P.K.: A population based hybrid FCM-PSO algorithm for clustering analysis and segmentation of brain image. Expert Syst. Appl. 167, 114121 (2021)
Tongbram, S., Shimray, B.A., Singh, L.S., Dhanachandra, N.: A novel image segmentation approach using FCM and whale optimization algorithm. J. Ambient Intell. Humaniz. Comput. (2021). https://doi.org/10.1007/s12652-020-02762-w
Geng, X.: Label distribution learning. IEEE Trans. Knowl. Data Eng. 28(7), 1734–1748 (2016)
Fan, Y., Liu, J., Tang, J., Liu, P., Lin, Y., Du, Y.: Learning correlation information for multilabel feature selection. Pattern Recognit. 145, 109899 (2024)
Dai, J., Huang, W., Zhang, C., Liu, J.: Multilabel feature selection by strongly relevant label gain and label mutual aid. Pattern Recognit. 145, 109945 (2024)
Lu, Y., Li, W., Li, H., Jia, X.Y.: Ranking-preserved generative label enhancement. Mach. Learn. 112, 4693–4721 (2023)
Lee, J., Kim, D.W.: Feature selection for multilabel classification using multivariate mutual information. Pattern Recognit. Lett. 34(3), 349–357 (2013)
Shi, E., Sun, L., Xu, J.C., Zhang, S.G.: Multilabel feature selection using mutual information and ML-ReliefF for multilabel classification. IEEE Access 8, 145381–145400 (2020)
Liu, J.H., Lin, Y.J., Ding, W.P., Zhang, H.B., Du, J.X.: Fuzzy Mutual information-based multilabel feature selection with label dependency and streaming labels. IEEE Trans. Fuzzy Syst. 31(1), 77–91 (2023)
Geng, X., Xu, N.: Label distribution learning and label enhancement. Sci. China Inf. Sci. 48(5), 521–530 (2018)
Liu, Y., Chen, H., Li, T., Li, W.: A robust graph based multilabel feature selection considering feature-label dependency. Appl. Intell. 53, 837–863 (2022)
Sun, L., Huang, M.M., Xu, J.C.: Weak label feature selection method based on neighborhood rough sets and Relief. Chin. Comput. Sci. 49(4), 152–160 (2022)
Zhang, J., et al.: Group-preserving label-specific feature selection for multilabel learning. Expert Syst. Appl. 213, 118861 (2022)
Liang, Y., Gan, J., Chen, Y., Zhou, P., Du, L.: Unsupervised feature selection algorithm based on dual manifold re-ranking. Chin. Comput. Sci. 50(7), 72–81 (2023)
Hashemi, A., Dowlatshahi, M.B., Nezamabadi-pour, H.: An efficient Pareto-based feature selection algorithm for multilabel classification. Inf. Sci. 581, 428–447 (2021)
Huang, R., Jiang, W., Sun, G.: Manifold-based constraint Laplacian score for multilabel feature selection. Pattern Recognit. Lett. 112, 346–352 (2018)
Zhang, J., et al.: Fast multilabel feature selection via global relevance and redundancy optimization. IEEE Trans. Neural Netw. Learn. Syst. 35(4), 5721–5734 (2022)
Gonzalez-Lopez, J., Ventura, S., Cano, A.: Distributed multilabel feature selection using individual mutual information measures. Knowl. Based Syst. 188, 105052 (2020)
Chen, L.L., Chen, D.G.: Alignment based feature selection for multilabel learning. Neural. Process. Lett. 50, 2323–2344 (2019)
Acknowledgments
This research was funded by the National Natural Science Foundation of China under Grants 62076089 and 61772176.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sun, L., Guo, J., Wu, X., Xu, J. (2024). Feature Selection via Label Enhancement and Weighted Neighborhood Mutual Information for Multilabel Data. In: Huang, DS., Zhang, C., Pan, Y. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science(), vol 14876. Springer, Singapore. https://doi.org/10.1007/978-981-97-5666-7_40
Download citation
DOI: https://doi.org/10.1007/978-981-97-5666-7_40
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5665-0
Online ISBN: 978-981-97-5666-7
eBook Packages: Computer ScienceComputer Science (R0)