Preprocessing vague imbalanced datasets and its use in genetic fuzzy classifiers

A. M. Palacios; L. Sánchez; I. Couso

Submitted by fjpr0013 on Tue, 23/04/2019 - 11:17

Title	Preprocessing vague imbalanced datasets and its use in genetic fuzzy classifiers
Publication Type	Conference Paper
Year of Publication	2010
Authors	Palacios, A. M., Sánchez L., and Couso I.
Conference Name	International Conference on Fuzzy Systems
Pagination	1-8
Date Published	July
Keywords	Classification algorithms, Context, data handling, Euclidean distance, fuzzy set theory, Fuzzy systems, genetic algorithms, genetic fuzzy classifier, genetic fuzzy system, Genetics, imbalanced dataset preprocessing, minimum error based classification system, Nearest neighbor searches, objective function, pattern classification, Pediatrics, Training
Abstract	When there is a substantial difference between the number of cases of the majority and minority classes, minimum error-based classification systems tend to overlook these last instances. This can be corrected either by preprocessing the dataset or by altering the objective function of the classifier. In this paper we analyze the first approach, in the context of genetic fuzzy systems (GFS), and in particular of those that can operate with imprecisely observed and low quality data. We will analyze the different preprocessing mechanisms of imbalanced datasets and will show the necessity of extending these for solving those problems where the data is both imprecise and im-balanced. In addition, we include a comprehensive description of a new algorithm, able to preprocess imprecise imbalanced datasets. Several real-world datasets are used to evaluate the proposal.
DOI	10.1109/FUZZY.2010.5584797