MLeNN: A First Approach to Heuristic Multilabel Undersampling

TitleMLeNN: A First Approach to Heuristic Multilabel Undersampling
Publication TypeConference Paper
Year of Publication2014
AuthorsCharte, Francisco, Rivera-Rivas A.J., del Jesus M. J., and Herrera F.
Conference Name15th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2014
Date Published9
Conference LocationSalamanca (Spain)
ISBN Number978-3-319-10839-1

Learning from imbalanced multilabel data is a challenging task that has attracted considerable attention lately. Some resampling algorithms used in traditional classification, such as random undersampling and random oversampling, have been already adapted in order to work with multilabel datasets. In this paper MLeNN (MultiLabel edited Nearest Neighbor), a heuristic multilabel undersampling algorithm based on the well-known Wilson's Edited Nearest Neighbor Rule, is proposed. The samples to be removed are heuristically selected, instead of randomly picked. The ability of MLeNN to improve classification results is experimentally tested, and its performance against multilabel random undersampling is analyzed. As will be shown, MLeNN is a competitive multilabel undersampling alternative, able to enhance significantly classification results.