Complementary information for the paper submitted "Improvement of subgroup descriptions in noisy data by detecting exceptions"

Abstract

The presence of noise in datasets to which data mining techniques are applied can greatly reduce the quality and interest of the knowledge extracted. Subgroup discovery is a supervised descriptive rule discovery technique which is not exempt from this
problem. The aim of this paper is to improve the descriptions of subgroups previously obtained by any subgroup discovery algorithm in noisy datasets. This is achieved using the post-processing approach of the MEFES algorithm, that rst detects exceptions in the input subgroups and then includes those exceptions in the descriptions. The experiments performed show the suitability of the proposal to improve the quality of the results.

 

Experimentation

Complete results obtained for the NMEEF-SD algorithm and NMEEF-SD+MEFES:

Complete results obtained for the SDIGA algorithm and SDIGA+MEFES:

Complete results obtained for the AprioriSD algorithm and AprioriSD+MEFES: