<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1665-6423</journal-id>
<journal-title><![CDATA[Journal of applied research and technology]]></journal-title>
<abbrev-journal-title><![CDATA[J. appl. res. technol]]></abbrev-journal-title>
<issn>1665-6423</issn>
<publisher>
<publisher-name><![CDATA[Universidad Nacional Autónoma de México, Instituto de Ciencias Aplicadas y Tecnología]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1665-64232020000600376</article-id>
<article-id pub-id-type="doi">10.22201/icat.24486736e.2020.18.6.1364</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Proposal for a KDD-based procedure to obtain a set of intelligent systems training applied to the identification of failures in hydroelectric power plants]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Valencia]]></surname>
<given-names><![CDATA[Andrés M.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Caratar]]></surname>
<given-names><![CDATA[Jesús]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Caicedo]]></surname>
<given-names><![CDATA[Gladys]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Chamorro]]></surname>
<given-names><![CDATA[Cristian]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Universidad del Valle  ]]></institution>
<addr-line><![CDATA[Cali ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Universidad del Valle Graduate Program of the School of Electrical and Electronic Engineering ]]></institution>
<addr-line><![CDATA[Cali ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Universidad del Valle Design Department ]]></institution>
<addr-line><![CDATA[Cali ]]></addr-line>
<country>Colombia</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>00</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>00</month>
<year>2020</year>
</pub-date>
<volume>18</volume>
<numero>6</numero>
<fpage>376</fpage>
<lpage>389</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1665-64232020000600376&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1665-64232020000600376&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1665-64232020000600376&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: This paper presents a procedure based on KDD (Knowledge Discovery Data), which allows the analysis of a data set to obtain structured information from the behavior of the system under specific conditions, such as system failure conditions at a hydroelectric power plant. By applying this procedure, the information obtained, it is structured in such a mode so that it can be used on the training of intelligent systems focused on fault diagnosis. The former procedure is necessary in the intelligent systems development stage because obtaining an effective training set requires extreme time and effort. The procedure was applied in the historical records of the Amaime hydroelectric power plant, located in Palmira, Valle del Cauca, Colombia, aiming to obtain patterns of behavior of the protection system which can be translated to different failures. This was possible by integrating a data mining technique such as hierarchical clustering and the statistical technique called the interpolation function. The main achievement of this work is to present a structured procedure that reduces the time to obtain a training set. In this specific case, the training set for mechanical failure of a hydroelectric power station was obtained, which can be used in the development of an intelligent system for failures diagnosis.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[knowledge discovery data]]></kwd>
<kwd lng="en"><![CDATA[data mining]]></kwd>
<kwd lng="en"><![CDATA[intelligent systems]]></kwd>
<kwd lng="en"><![CDATA[failure diagnosis]]></kwd>
<kwd lng="en"><![CDATA[training set]]></kwd>
<kwd lng="en"><![CDATA[hydroelectric power plant]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Amaya Simeón]]></surname>
<given-names><![CDATA[E. J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Aplicação de Técnicas de Inteligência Artificial no Desenvolvimento de um Sistema de Manutenção Baseada em Condição]]></source>
<year>2008</year>
<publisher-loc><![CDATA[Brasil ]]></publisher-loc>
<publisher-name><![CDATA[Universidade de Brasília]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carreño-Pérez]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Morales-Rivera]]></surname>
<given-names><![CDATA[J. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Rivas-Trujillo]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Redundancy in communication networks for the automation and protection of electrical power systems with IEC 61850]]></article-title>
<source><![CDATA[Informacion Tecnologica]]></source>
<year>2019</year>
<volume>30</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>75-86</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="">
<collab>Celsia</collab>
<source><![CDATA[Centrales hidroeléctricas]]></source>
<year>2020</year>
</nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cibulková]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[&#352;ulc]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Sirota]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[&#344;ezanková]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The effect of binary data transformation in categorical data clustering]]></article-title>
<source><![CDATA[Statistics in Transition]]></source>
<year>2019</year>
<volume>20</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>33-47</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dominguez Gavilanes]]></surname>
<given-names><![CDATA[E. X.]]></given-names>
</name>
<name>
<surname><![CDATA[Logroño Vargas]]></surname>
<given-names><![CDATA[D. O.]]></given-names>
</name>
</person-group>
<source><![CDATA[Diseño e Implementación del Control Automático y Monitoreo del Nivel del Embalse en la Central Hidroeléctrica Agoyán]]></source>
<year>2010</year>
<publisher-loc><![CDATA[Quito, Ecuador ]]></publisher-loc>
<publisher-name><![CDATA[Escuela politécnica nacional]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dorantes]]></surname>
<given-names><![CDATA[P. N. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez]]></surname>
<given-names><![CDATA[J. P. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Mendez]]></surname>
<given-names><![CDATA[G. M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Fault Detection Systems via a Novel Hybrid Methodology for Fuzzy Logic Systems Based on Individual Base Inference and Statistical Process Control]]></article-title>
<source><![CDATA[IEEE latin america transactions]]></source>
<year>2014</year>
<volume>12</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>706-12</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Devore]]></surname>
<given-names><![CDATA[J. L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Probability and statistics for engineering and sciences]]></source>
<year>2016</year>
<edition>Ninth</edition>
<publisher-loc><![CDATA[California ]]></publisher-loc>
<publisher-name><![CDATA[Cengage learning]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ebtehaj]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Bonakdari]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Zeynoddin]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Gharabaghi]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Azari]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Evaluation of preprocessing techniques for improving the accuracy of stochastic rainfall forecast models]]></article-title>
<source><![CDATA[International Journal of Environmental Science and Technology]]></source>
<year>2020</year>
<volume>17</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>505-24</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Efrén]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Alvarado]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Algoritmo neuro-difuso para la detección y clasificación de fallas en líneas de transmisión eléctrica del sistema ecuatoriano usando simulaciones y datos de registradores de fallas]]></source>
<year>2012</year>
<publisher-name><![CDATA[Universidad de Cuenca]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fayyad]]></surname>
<given-names><![CDATA[U. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Piatetsky-Shapiro]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Smyth]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Uthurusamy]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Advances in knowledge discovery and data mining]]></article-title>
<source><![CDATA[American Association for Artificial Intelligence]]></source>
<year>1996</year>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[García]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Molina]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Berlanga]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Patricio]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bustamante]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Padilla]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Ciencia de datos]]></article-title>
<source><![CDATA[Técnicas Analíticas y Aprendizaje Estadístico]]></source>
<year>2018</year>
<publisher-loc><![CDATA[Bogotá, Colombia ]]></publisher-loc>
<publisher-name><![CDATA[Publicaciones Altaria, SL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Han]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Kamber]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Pei]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data mining: concepts and techniques]]></source>
<year>2012</year>
<page-range>978-1</page-range><publisher-loc><![CDATA[Waltham, MA ]]></publisher-loc>
<publisher-name><![CDATA[Morgan Kaufman Publishers]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Morales]]></surname>
<given-names><![CDATA[C. O. H.]]></given-names>
</name>
<name>
<surname><![CDATA[González]]></surname>
<given-names><![CDATA[J. P. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Siller]]></surname>
<given-names><![CDATA[E. G. C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Detección y diagnóstico de fallas en sistemas eléctricos de potencia (SEP) combinando lógica difusa, métricas y una red neuronal probabilística]]></article-title>
<source><![CDATA[Res. Comput. Sci.]]></source>
<year>2014</year>
<volume>72</volume>
<page-range>47-59</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Palacios]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Echeverría]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Barba]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Estudio del Impacto de la Implementación del Sistema de Protección Sistémica en la Operación del Sistema Nacional Interconectado]]></article-title>
<source><![CDATA[Revista Técnica" energía"]]></source>
<year>2016</year>
<volume>12</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>112-20</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Penin]]></surname>
<given-names><![CDATA[A. R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Sistemas SCADA]]></source>
<year>2007</year>
<edition>2</edition>
<publisher-loc><![CDATA[Barcelona, España ]]></publisher-loc>
<publisher-name><![CDATA[marcombo, S.A.]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Real]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Vargas]]></surname>
<given-names><![CDATA[J. M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The Probabilistic Basis of Jaccard&#8217;s Index of Similarity]]></article-title>
<source><![CDATA[Systematic Biology]]></source>
<year>1996</year>
<volume>45</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>380-5</page-range></nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ristoski]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Paulheim]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Semantic Web in data mining and knowledge discovery: A comprehensive survey]]></article-title>
<source><![CDATA[Journal of Web Semantics]]></source>
<year>2016</year>
<volume>36</volume>
<page-range>1-22</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Osorio]]></surname>
<given-names><![CDATA[J. F. S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Energía hidroeléctrica]]></source>
<year>2008</year>
<volume>139</volume>
<publisher-name><![CDATA[Universidad de Zaragoza]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sarkar]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Sharma]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Baral]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Chatterjee]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Dey]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Chakravorti]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[An expert system approach for transformer insulation diagnosis combining conventional diagnostic tests and PDC, RVM data]]></article-title>
<source><![CDATA[IEEE Transactions on Dielectrics and Electrical Insulation]]></source>
<year>2014</year>
<volume>21</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>882-91</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Soler]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Berroterán]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Gil]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Acosta]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Índice valor de importancia, diversidad y similaridad florística de especies leñosas en tres ecosistemas de los llanos centrales de Venezuela]]></article-title>
<source><![CDATA[Agronomía Trop]]></source>
<year>2012</year>
<volume>62</volume>
<numero>1-4</numero>
<issue>1-4</issue>
<page-range>25-37</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
