|Title||MLeNN: A First Approach to Heuristic Multilabel Undersampling|
|Publication Type||Conference Paper|
|Year of Publication||2014|
|Authors||Charte, Francisco, Rivera Antonio J., del Jesus M. J., and Herrera F.|
|Conference Name||15th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2014|
|Conference Location||Salamanca (Spain)|
Learning from imbalanced multilabel data is a challenging task that has attracted considerable attention lately. Some resampling algorithms used in traditional classification, such as random undersampling and random oversampling, have been already adapted in order to work with multilabel datasets. In this paper MLeNN (MultiLabel edited Nearest Neighbor), a heuristic multilabel undersampling algorithm based on the well-known Wilson's Edited Nearest Neighbor Rule, is proposed. The samples to be removed are heuristically selected, instead of randomly picked. The ability of MLeNN to improve classification results is experimentally tested, and its performance against multilabel random undersampling is analyzed. As will be shown, MLeNN is a competitive multilabel undersampling alternative, able to enhance significantly classification results.
MLeNN: A First Approach to Heuristic Multilabel Undersampling