MLeNN: A First Approach to Heuristic Multilabel Undersampling

Author
Abstract
Learning from imbalanced multilabel data is a challenging task that has attracted considerable attention lately. Some resampling algorithms used in traditional classification, such as random undersampling and random oversampling, have been already adapted in order to work with multilabel datasets. In this paper MLeNN (MultiLabel edited Nearest Neighbor), a heuristic multilabel undersampling algorithm based on the well-known Wilson s Edited Nearest Neighbor Rule, is proposed. The samples to be removed are heuristically selected, instead of randomly picked. The ability of MLeNN to improve classification results is experimentally tested, and its performance against multilabel random undersampling is analyzed. As will be shown, MLeNN is a competitive multilabel undersampling alternative, able to enhance significantly classification results.
Year of Publication
2014
Date Published
9
Conference Location
Salamanca (Spain)
ISBN Number
978-3-319-10839-1
DOI
10.1007/978-3-319-10840-7_1
Download citation
Number of Pages
1-9
Bibliography media
Notes

TIN2011-28488,TIN2012-33856,P10-TIC-06858,P11-TIC-9704