SciELO - Scientific Electronic Library Online

 
vol.27 issue1Automatic Recognition of Leukemia AML Using Evolutionary VisionGeneric and Update Multi-Document Text Summarization based on Genetic Algorithm author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

CHAPARRO-AMARO, Óscar Roberto; MARTINEZ-FELIPE, Miguel de Jesús  and  MARTINEZ-CASTRO, Jesús Alberto. Performance of the Classification of Critical Residues at the Interface of BMPs Complexes Pondered with the Ground-State Energy Feature Using Random Forest Classifier. Comp. y Sist. [online]. 2023, vol.27, n.1, pp.257-267.  Epub June 16, 2023. ISSN 2007-9737.  https://doi.org/10.13053/cys-27-1-4537.

This work is focused on implementing and evaluating the Random Forest Classifier (RFC), among other classical machine learning models, on predicting the residues at the interface of protein-protein interactions (PPI) that contribute most of the binding free energy (called hot spots and hot regions). The dataset comprises twenty-nine bone morphogenetic proteins (BMPs) complexes from the Protein Data Bank (PDB). We used just six features such as B-factor, hydrophobicity index, prevalence score, accessible surface area (ASA), conservation score, and the ground-state energy of the amino acids, which were calculated using the Density Functional Theory (DFT). Proving and testing several machine learning methods, we selected the RCF because of its better performance using classical classification metrics and tests. An optimal parameter selection of the RFC reached a better performance using this dataset with around 90 % with the correct class assigned (hot spot & hot region / non-hot spot hot region) residues.

Keywords : Hot spots; hot regions; BMPs; DFT; RFC.

        · text in English     · English ( pdf )