SciELO - Scientific Electronic Library Online

 
 issue54IN-DEDUCTIVE and DAG-Tree Approaches for Large-Scale Extreme Multi-label Hierarchical Text ClassificationUnderstanding Human Preferences for Summary Designs in Online Debates Domain author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Polibits

On-line version ISSN 1870-9044

Abstract

ANTON-VARGAS, Jarvin A.; VILLUENDAS-REY, Yenny  and  LOPEZ-YANEZ, Itzamá. Instance Selection to Improve Gamma Classifier. Polibits [online]. 2016, n.54, pp.71-77. ISSN 1870-9044.  http://dx.doi.org/10.17562/PB-54-9.

Pre-processing the dataset is an important stage in the Knowledge Discovery in Datasets (KDD) process. Filtering noise through instance selection is a necessary task. With this, the risk to use misclassified and non-representative instances to train supervised classifiers is reduced. This study aims at improving the performance of the Gamma associative classifier, by introducing a novel similarity function to guide instance selection. The experimental results, over 15 datasets, include several instance selection methods, and their influence in the performance of Gamma classifier is analyzed. The effectiveness of the proposed similarity function is tested, obtaining good results according to classifier accuracy and instance retention ratio.

Keywords : Gamma classifier; instance selection; data pre-processing; similarity functions.

        · text in English     · English ( pdf )