Concurrence among Imbalanced Labels and Its Influence on Multilabel Resampling Algorithms

TitleConcurrence among Imbalanced Labels and Its Influence on Multilabel Resampling Algorithms
Publication TypeConference Paper
Year of Publication2014
AuthorsCharte, Francisco, Rivera-Rivas A.J., del Jesus M. J., and Herrera F.
Conference Name9th International Conference on Hybrid Artificial Intelligent Systems (HAIS 2014)
Pagination110–121
Date Published6
Conference LocationSalamanca (Spain)
ISBN Number978-3-319-07616-4
Abstract

In the context of multilabel classification, the learning from imbalanced data is getting considerable attention recently. Several algorithms to face this problem have been proposed in the late five years, as well as various measures to assess the imbalance level. Some of the proposed methods are based on resampling techniques, a very well-known approach whose utility in traditional classification has been proven. This paper aims to describe how a specific characteristic of multilabel datasets (MLDs), the level of concurrence among imbalanced labels, could have a great impact in resampling algorithms behavior. Towards this goal, a measure named SCUMBLE, designed to evaluate this concurrence level, is proposed and its usefulness is experimentally tested. As a result, a straightforward guideline on the effectiveness of multilabel resampling algorithms depending on MLDs characteristics can be inferred.

Notes

TIN2011-28488,TIN2012-33856,P10-TIC-06858,P11-TIC-9704

DOI10.1007/978-3-319-07617-1_10