In the context of multi-label learning, feature selection (MLFS) is a key process for handling high-dimensional datasets, aiming to retain the most informative features while preserving inter-label relationships. This study presents an extensive overview of state-of-the-art MLFS approaches founded on principles from information theory. The paper first introduces the fundamental concepts of information theory, then provides a detailed review of representative MLFS methods along with their theoretical background. Performance assessments are carried out on real-world multi-label datasets, allowing us to highlight the advantages and shortcomings of each method. For comparison, we employ widely used evaluation metrics such as Hamming Loss, Accuracy, Label Ranking Loss, and F1-score. We also outline future research perspectives, including the design of a new feature relevance criterion that integrates label importance weighting and redundancy reduction based on label dependencies, with the aim of enhancing both feature selection and multi-label classification accuracy.
- Chamlal H., Aaboub F., Ouaderhman T. A preordonance-based decision tree method and its parallel implementation in the framework of Map-Reduce. Applied Soft Computing. 167 (A), 112261 (2024).
- Chamlal H., Rebbah F. E., Ouaderhman T. An ensemble classifier combining Dempster–Shafer theory and feature selection methods aggregation strategy. Applied Soft Computing. 180, 113306 (2025).
- Cover T. M., Thomas J. A. Elements of Information Theory. John Wiley & Sons, Inc. (2006).
- Kashef S., Nezamabadi-pour H., Nikpour B. Multilabel feature selection: A comprehensive review and guiding experiments. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery. 8 (2), e1240 (2018).
- Zhang M.-L., Li Y.-K., Liu X.-Y., Geng X. Binary relevance for multi-label learning: an overview. Frontiers of Computer Science. 12, 191–272 (2018).
- Kamalov F., Sulieman H., Alzaatreh A., Emarly M., Chamlal H., Safaraliev M. Mathematical Methods in Feature Selection: A Review. Mathematics. 13 (6), 996 (2025).
- Peng H., Long F., Ding C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence. 27 (8), 1226–1238 (2005).
- Chamlal H., Ouaderhman T., Aaboub F. Preordonance correlation filter for feature selection in the high dimensional classification problem. 2021 Fifth International Conference On Intelligent Computing in Data Sciences (ICDS). 1–5 (2021).
- Chamlal H., Ouaderhman T., Rebbah F. E. A novel filter based feature selection approach for microarray dataset. 2021 Fifth International Conference On Intelligent Computing in Data Sciences (ICDS). 1–6 (2021).
- Spolaôr N., Cherman E. A., Monard M. C., Lee H. D. A comparison of multi-label feature selection methods using the problem transformation approach. Electronic Notes in Theoretical Computer Science. 292, 135–171 (2013).
- Read J. A pruned problem transformation method for multi-label classification. Proceedings of New Zealand Computer Science Research Student Conference, NZCSRSC 2008. 143–150 (2008).
- Lee J.-S., Kim D.-W. Mutual information-based multi-label feature selection using interaction information. Expert Systems with Applications. 42 (4), 2013–2725 (2015).
- Zhang P., Liu G., Gao W. Distinguishing two types of labels for multi-label feature selection. Pattern Recognition. 95, 72–82 (2019).
- Zhang P., Liu G., Gao W., Song J. Multi-label feature selection considering label supplementation. Pattern Recognition. 120, 108137 (2021).
- Lee J., Kim D.-W. Feature selection for multi-label classification using multivariate mutual information. Pattern Recognition Letters. 34 (3), 349–377 (2013).
- Lee J., Kim D.-W. SCLS: Multi-label feature selection based on scalable criterion for large label set. Pattern Recognition. 66, 342–372 (2017).
- Ma X.-A., Liu H., Liu Y., Zhang J. Z. Multi-label feature selection considering label importance-weighted relevance and label-dependency redundancy. European Journal of Operational Research. 322 (1), 215–236 (2025).
- Han Q., Zhao Z., Hu L., Gao W. Enhanced multi-label feature selection considering label-specific relevant information. Expert Systems with Applications. 264, 125819 (2025).
- Zhang S., Li Y., Zhang P., Gao W. Exploring multi-label feature selection via feature and label information supplementation. Engineering Applications of Artificial Intelligence. 159 (A), 111552 (2025).
- Pereira R. B., Plastino A., Zadrozny B., Merschmann L. H. C. Correlation analysis of performance measures for multi-label classification. Information Processing & Management. 54 (3), 359–369 (2018).