Serviços Personalizados
Journal
Artigo
Indicadores
Citado por SciELO
Acessos
Links relacionados
Similares em
SciELO
Compartilhar
Computación y Sistemas
versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546
Resumo
GARATE-ESCAMILLA, Anna Karen; ORTIZ-BAYLISS, José Carlos e TERASHIMA-MARIN, Hugo. Machine Learning, Missing Values, and Algorithm Selectors: The Untold Story. Comp. y Sist. [online]. 2025, vol.29, n.1, pp.311-323. Epub 05-Dez-2025. ISSN 2007-9737. https://doi.org/10.13053/cys-29-1-5508.
This paper presents a study of the potential benefits of incorporating missing values into the training process of algorithm selectors powered by machine learning algorithms, particularly those used for classification. This work analyzes various scenarios related to omitting some of the data available for training and measures the performance of the algorithm selectors produced to estimate how resistant they are to the presence of missing values within the training data. Our experiments open a new and exciting perspective on training algorithm selectors, one where it is possible to save computational resources by omitting some calculations, reducing the effort to produce such selectors, but without significantly harming their performance on unseen instances. For example, our results show that given a proper training set and deciding which runs to omit completely at random, some Machine Learning strategies such as Neural Networks, Naïve Bayes Classifiers, and Support Vector Machines can correctly operate as algorithm selectors with up to 50% of the data missing (data about the solvers to choose from), without any further treatment of the missing values.
Palavras-chave : Algorithm selection; bin packing problem; machine learning; missing values.












