Resampling Multilabel Datasets by Decoupling Highly Imbalanced Labels

Francisco Charte; A.J. Rivera-Rivas; M. J. del Jesus; F. Herrera

Submitted by fcharte on Thu, 28/02/2019 - 14:06

Title	Resampling Multilabel Datasets by Decoupling Highly Imbalanced Labels
Publication Type	Conference Paper
Year of Publication	2015
Authors	Charte, Francisco, Rivera-Rivas A.J., del Jesus M. J., and Herrera F.
Conference Name	10th International Conference on Hybrid Artificial Intelligent Systems, HAIS 2015
Pagination	489–501
Date Published	6
Conference Location	Bilbao (Spain)
ISBN Number	978-3-319-19643-5
Abstract	Multilabel classification is a task that has been broadly studied in late years. However, how to face learning from imbalanced multilabel datasets (MLDs) has only been addressed latterly. In this regard, a few proposals can be found in the literature, most of them based on resampling techniques adapted from the traditional classification field. The success of these methods varies extraordinarily depending on the traits of the chosen MLDs. One of the characteristics which significantly influences the behavior of multilabel resampling algorithms is the joint appearance of minority and majority labels in the same instances. It was demonstrated that MLDs with a high level of concurrence among imbalanced labels could hardly benefit from resampling methods. This paper proposes an original resampling algorithm, called REMEDIAL, which is not based on removing majority instances nor creating minority ones, but on a procedure to decouple highly imbalanced labels. As will be experimentally demonstrated, this is an interesting approach for certain MLDs.
Notes	TIN2011-28488,TIN2012-33856,P10-TIC-06858,P11-TIC-7765
DOI	10.1007/978-3-319-19644-2_41

Fichero:

2015-HAIS-REMEDIAL.pdf