SciELO - Scientific Electronic Library Online

 
vol.24 issue3Minimum Cost for Reinforced Concrete Rectangular Beams with Parabolic HaunchesFuzzy Parameter Adaptation in Genetic Algorithms for the Optimization of Fuzzy Integrators in Modular Neural Networks for Multimodal Biometry author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

OUALI, Mohammed; MAHDI, Walid; GHARBAOUI, Radhwane  and  MEDJAHED, Seyyid Ahmed. Controlling 2D Artificial Data Mixtures Overlap. Comp. y Sist. [online]. 2020, vol.24, n.3, pp.1075-1091.  Epub June 09, 2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-24-3-3326.

Clustering methods are used for identifying groups of similar objects considered as homogenous set. Unfortunately, analytic performance evaluation of clustering methods is a difficult task because of their ad-hoc nature. In this paper, we propose a new test case generator of artificial data for 2 dimensional Gaussian mixtures. The proposed generator has two interesting advantages: the first one is its ability to produce simulated mixture for any number of components, while the second one resides in the fact that it formally quantifies the overlap rate which allows us to add some complexity to the data. Clustering algorithms and validity indices behavior is also analyzed by changing the overlap rate between clusters.

Keywords : Clustering algorithms; unsupervised learning; Gaussian mixture; Gaussian components overlap.

        · text in English     · English ( pdf )