<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>0188-9532</journal-id>
<journal-title><![CDATA[Revista mexicana de ingeniería biomédica]]></journal-title>
<abbrev-journal-title><![CDATA[Rev. mex. ing. bioméd]]></abbrev-journal-title>
<issn>0188-9532</issn>
<publisher>
<publisher-name><![CDATA[Sociedad Mexicana de Ingeniería Biomédica]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S0188-95322011000200004</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Identification of functional sequences using associative memories]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Román-Godínez]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Garibay-Orijel]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Yáñez-Márquez]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Instituto Politécnico Nacional Centro de Investigación en Computación Laboratorio de Redes Neuronales y Computo no Convencional]]></institution>
<addr-line><![CDATA[México Distrito Federal]]></addr-line>
<country>México</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Instituto Politécnico Nacional Unidad Profesional Interdisciplinaria de Biotecnología Laboratorio de Bioingeniería]]></institution>
<addr-line><![CDATA[México Distrito Federal]]></addr-line>
<country>México</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2011</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2011</year>
</pub-date>
<volume>32</volume>
<numero>2</numero>
<fpage>109</fpage>
<lpage>118</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S0188-95322011000200004&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S0188-95322011000200004&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S0188-95322011000200004&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[The identification and discrimination of functional sequences or mutations is very helpful in the medical area. Promoter and splice-junction identification, gene finding, DNA or Aminoacid database searching are some examples. Pattern recognition algorithms are candidates to perform this tasks. In this work we present a model, based on AlphaBeta associative memory and NeedlemanWunsch algorithm, to correctly recall altered version of learning patterns with one or more of the following modifications: insertions, deletions, and mutations, very common alterations in DNA and Aminoacid sequences. Moreover, this model preserve one of the most important advantages in associative memories, the correct recall of the fundamental set. To test the performance of the algorithm on bioinformatics and biomedical applications, the model presented here was tested using two datasets; one from the UCI repository; refered to promoter identification and the second one to using the genome of the Variovoraxparadoxus organism obtained from the NCBI repository.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[La identificación y discriminación de secuencias funcionales son de mucha ayuda en la investigación en el área biomédica. Identificación de promotores, identificación de zonas de empalme, búsqueda de genes y búsqueda de secuencias de ADN y aminoácidos en bases de datos son algunos ejemplos de aplicaciones en dicha área de investigación. Dada la naturaleza del problema, los algoritmos de reconocimiento de patrones son candidatos naturales para llevar a cabo las tareas antes mencionadas. En el presente trabajo se propone un nuevo modelo de memorias asociativas Alfa-Beta, basadas en el modelo original de memorias y el algoritmo global de alineamiento de secuencias desarrollado por Needleman-Wunsch, que permiten la recuperación de patrones alterados con respecto de los patrones de aprendizaje con alguna de las siguientes alteraciones: mutaciones, inserciones y borrados; alteraciones comunes en secuencias de DNA y aminoácidos. El presente modelo preserva una de las más importantes ventajas en memorias asociativas, la recuperación completa del conjunto fundamental. Para probar el desempeño del modelo en aplicaciones tanto de bioinformática como biomédica, se utilizaron dos bases de datos; una obtenida del repositorio de la Universidad de California en Irvine; sobre secuencias que contienen promotores y la segunda del genoma del organismo Variovorax paradoxus obtenida del repositorio de la NCBI.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Promoters]]></kwd>
<kwd lng="en"><![CDATA[aminoacid sequences retrieval]]></kwd>
<kwd lng="en"><![CDATA[DNA sequence retrieval]]></kwd>
<kwd lng="en"><![CDATA[associative memories]]></kwd>
<kwd lng="en"><![CDATA[biomedical]]></kwd>
<kwd lng="en"><![CDATA[bioinformatics]]></kwd>
<kwd lng="en"><![CDATA[Variovorax paradoxus]]></kwd>
<kwd lng="es"><![CDATA[Promotores]]></kwd>
<kwd lng="es"><![CDATA[recuperación de secuencias de aminoácidos]]></kwd>
<kwd lng="es"><![CDATA[recuperación de secuencias de ADN]]></kwd>
<kwd lng="es"><![CDATA[memorias asociativas]]></kwd>
<kwd lng="es"><![CDATA[bioinformática]]></kwd>
<kwd lng="es"><![CDATA[Variovorax paradoxus]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="justify"><font face="verdana" size="4">Art&iacute;culo de investigaci&oacute;n original</font></p>     <p align="justify">&nbsp;</p>      <p align="center"><font face="verdana" size="4"><b>Identification of functional sequences using associative memories</b></font></p>     <p align="center">&nbsp;</p>      <p align="center"><b><font face="verdana" size="2">Rom&aacute;n&#45;God&iacute;nez I*, Garibay&#45;Orijel C**, Y&aacute;&ntilde;ez&#45;M&aacute;rquez C***</font></b><font face="verdana" size="2"></font></p>     <p align="center">&nbsp;</p>      <p align="justify"><font face="verdana" size="2"><i>* Laboratorio de Redes Neuronales y Computo no Convencional, Centro de Investigaci&oacute;n en Computaci&oacute;n, Instituto Polit&eacute;cnico Nacional, M&eacute;xico, D.F. </i></font></p>     <p align="justify"><i><font face="verdana" size="2">** Laboratorio de Bioingenier&iacute;a, Unidad Profesional Interdisciplinaria de Biotecnolog&iacute;a, Instituto Polit&eacute;cnico Nacional, M&eacute;xico, D.F.</font></i><font face="verdana" size="2"></font></p>     <p align="justify">&nbsp;</p>      ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Correspondence: </b>    <br> </font><font face="verdana" size="2">Israel Rom&aacute;n God&iacute;nez.    <br> </font><font face="verdana" size="2">Av. Juan de Dios B&aacute;tiz s/n casi esquina Miguel     <br> Oth&oacute;n de Mendiz&aacute;bal, Unidad Profesional     <br> &laquo;Adolfo L&oacute;pez Mateos&raquo;, Edificio CIC,     <br> Col. Nueva Industrial Vallejo, 07738, Del.     <br> Gustavo A. Madero, M&eacute;xico, D.F.     <br> E&#45;mail: <a href="mailto:israelromang@hotmail.com">israelromang@hotmail.com</a></font></p>     <p align="justify">&nbsp;</p>     <p align="justify"><font face="verdana" size="2">Received article: 06/junio/2011.    ]]></body>
<body><![CDATA[<br> </font><font face="verdana" size="2"> Accepted article: 09/septiembre/2011.</font></p>     <p align="justify">&nbsp;</p>  	    <p align="justify"><font face="verdana" size="2"><b>ABSTRACT</b></font></p>  	    <p align="justify"><font face="verdana" size="2">The identification and discrimination of functional sequences or mutations is very helpful in the medical area. Promoter and splice&#45;junction identification, gene finding, DNA or Aminoacid database searching are some examples. Pattern recognition algorithms are candidates to perform this tasks. In this work we present a model, based on AlphaBeta associative memory and NeedlemanWunsch algorithm, to correctly recall altered version of learning patterns with one or more of the following modifications: insertions, deletions, and mutations, very common alterations in DNA and Aminoacid sequences. Moreover, this model preserve one of the most important advantages in associative memories, the correct recall of the fundamental set. To test the performance of the algorithm on bioinformatics and biomedical applications, the model presented here was tested using two datasets; one from the UCI repository; refered to promoter identification and the second one to using the genome of the <i>Variovoraxparadoxus</i> organism obtained from the NCBI repository.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Key Words:</b> Promoters, aminoacid sequences retrieval, DNA sequence retrieval, associative memories, biomedical, bioinformatics, Variovorax paradoxus.</font></p>     <p align="justify">&nbsp;</p>      <p align="justify"><font face="verdana" size="2"><b>RESUMEN</b></font></p>  	    <p align="justify"><font face="verdana" size="2">La identificaci&oacute;n y discriminaci&oacute;n de secuencias funcionales son de mucha ayuda en la investigaci&oacute;n en el &aacute;rea biom&eacute;dica. Identificaci&oacute;n de promotores, identificaci&oacute;n de zonas de empalme, b&uacute;squeda de genes y b&uacute;squeda de secuencias de ADN y amino&aacute;cidos en bases de datos son algunos ejemplos de aplicaciones en dicha &aacute;rea de investigaci&oacute;n. Dada la naturaleza del problema, los algoritmos de reconocimiento de patrones son candidatos naturales para llevar a cabo las tareas antes mencionadas. En el presente trabajo se propone un nuevo modelo de memorias asociativas Alfa&#45;Beta, basadas en el modelo original de memorias y el algoritmo global de alineamiento de secuencias desarrollado por Needleman&#45;Wunsch, que permiten la recuperaci&oacute;n de patrones alterados con respecto de los patrones de aprendizaje con alguna de las siguientes alteraciones: mutaciones, inserciones y borrados; alteraciones comunes en secuencias de DNA y amino&aacute;cidos. El presente modelo preserva una de las m&aacute;s importantes ventajas en memorias asociativas, la recuperaci&oacute;n completa del conjunto fundamental. Para probar el desempe&ntilde;o del modelo en aplicaciones tanto de bioinform&aacute;tica como biom&eacute;dica, se utilizaron dos bases de datos; una obtenida del repositorio de la Universidad de California en Irvine; sobre secuencias que contienen promotores y la </font><font face="verdana" size="2">segunda del genoma del organismo <i>Variovorax paradoxus</i> obtenida del repositorio de la NCBI.</font></p>     <p align="justify"><font face="verdana" size="2"><b>Palabras clave:</b> Promotores, recuperaci&oacute;n de secuencias de amino&aacute;cidos, recuperaci&oacute;n de secuencias de ADN, memorias asociativas, bioinform&aacute;tica, Variovorax paradoxus.</font></p>     <p align="justify">&nbsp;</p>      ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>INTRODUCTION</b></font></p>  	    <p align="justify"><font face="verdana" size="2">In the later decades, very important scientific advances in the field of molecular biology have been achieved. Thanks to the enormous ammounts of information derived from these advances, there has arisen a need to process such information in a faster manner and just as effectively, or more, than by an expert. This gives birth to a new branch of science, known as <i>Bioinformatics:</i> a multidisciplinary field which combines, among others, two important fields of science, <i>molecular biology</i> and <i>computer sciences<sup>1</sup>.</i> Its main objective is to apply the advantages offered by computer sciences to molecular biology, given the fast development, efficiency and efficacy of the algorithms of the former<sup>2</sup>.</font></p>  	    <p align="justify"><font face="verdana" size="2">Among the first and foremost problems boarded by Bioinformatics are: the development of databases, protein sequence alignment, DNA string sequencing, protein structure prediction, protein structure classification, promoter identification, splice&#45;junction zone localization, and filogenetic relationships determining<sup>3,4</sup>.</font></p>  	    <p align="justify"><font face="verdana" size="2"><i>Deoxyribonucleic acid (DNA)</i> and <i>proteins</i> are biological macromolecules made up of long chains of chemical components. On one hand, DNA is made up of <i>nucleotides,</i> of which there are four: <i>adenine</i> (A), <i>cytosine</i> (C), <i>guanine</i> (G), and <i>thymine</i> (T), denoted by their initials. Also, DNA plays a fundamental role in different biochemical processes of living organisms, such as protein synthesis and hereditary information transmission from parents to children<sup>5</sup>.</font></p>  	    <p align="justify"><font face="verdana" size="2">Promoters are the regions in the DNA that regulates the expression of the proteins and are regularly before each gene<sup>6&#45;8</sup>.</font></p>  	    <p align="justify"><font face="verdana" size="2">On the other hand, proteins are polypeptides formed inside cells as sequences of 20 different <i>aminoacids<sup>9</sup>,</i> which are denoted by 20 different letters. Each of these 20 aminoacids is coded by one or more <i>codons<sup>5</sup>.</i> The chemical properties differentiating the 20 aminoacids make them group together to conform proteins with certain tridimentional structures, defining the specific functions of </font><font face="verdana" size="2">a cell<sup>8</sup>.</font></p>     <p align="justify"><font face="verdana" size="2">Several diseases are generated by point mutations, insertions or deletions in the DNA. Thus, pattern recognition plays an important role in medical area. The recognition of the mutations leads to a better understanding of the disease and the development of new techniques and equipment. An example was the Influenza outbreak in M&eacute;xico City, where the use of Bioinformatics gave important information about the pandemia, and lead to the development of different techniques and equipment for its identification and vaccination<sup>10,11</sup>.</font></p>  	    <p align="justify"><font face="verdana" size="2">The topic of associative memories has been an active field of scientific research for some decades, attracting the attention in some research areas for the great power they offer despite the simplicity of its algorithms. The most important characteristic and, at the same time, fundamental purpose of an associative memory, is to correctly recall whole output patterns from input patterns, with the possibility of having the latter altered, either by an additive, subtractive, or mixed alteration<sup>12&#45;14</sup>. This kind of alterations could be classified as mutation based on their definition, but the insertions and deletions was not treatable by this algorithm.</font></p>  	    <p align="justify"><font face="verdana" size="2">An associative memory has two phases: the learning phase, which is the process that allows the memory to be built by learning associations of patterns, and the recalling phase, in which the memory is presented with input patterns that can be present in the fundamental set or not, and the memory output, the corresponding associated pattern, according to the associations learned<sup>13,14</sup>.</font></p>  	    <p align="justify"><font face="verdana" size="2">Associative memories, and specifically alpha&#45;beta associative memories, are a powerful computational tool in pattern recognition due to the simplicity of the iralgorithms, their strong mathematical foundation, and the high efficacy shown by them in pattern recalling and classification<sup>15</sup>.</font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">In this paper we propose the use of a robust alpha&#45;beta associative memory model for retrieval of sequences from a Aminoacid&#45;database. This model has the capability of managing learning and recalling patterns of different dimensions. Moreover, this model can handle insertion, deletion and mutation in sequences, keeping its capability of complete </font><font face="verdana" size="2">recall of the fundamental set. These characteristics are not present in the models found in our research of the state&#45;of&#45;art.</font></p>     <p align="justify"><font face="verdana" size="2">This paper is organized as follows. <i>Tools</i> is focused on explaining the Alpha&#45;Beta heteroassociative memory model which is the main tool of this paper. <i>Robust Retrieval Associative Memory</i> contains the core proposal and its theoretical support. <i>Results</i> is devoted to the experimental results and finally, <i>Conclusion and Future work</i> addresses thoughts derived from this work.</font></p>     <p align="justify">&nbsp;</p>      <p align="justify"><font face="verdana" size="2"><b>TOOLS</b></font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Alpha&#45;Beta associative memories</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Here we introduce the basic notation of associative memories as presented in<sup>15</sup>. An associative memory <b>M</b> is a system that relates input and outputs patterns. Each input vector <b>x</b> forms an association with a corresponding output vector <b>y</b>. The <i>k</i> &#45;th association will be denoted as (<b><i>x<sup>k</sup>,y<sup>k</sup></i></b>). Associative memory <b>M</b> is represented by a matrix whose component in the <i>i&#45;</i>th row and <i>j</i>&#45;th column is denoted <i>m<sub>ij</sub>.</i> The <i>m<sub>ij</sub></i> generated from a set of <i>a priori</i> known associations, called the fundamental set. The fundamental set is defined as follows: <i>{(x<sup>&micro;</sup>,y<sup>&micro;</sup>)&#124;&micro; = 1,2,...,p}</i> where <i>p</i> the cardinality of the fundamental set. The patterns that belong to the fundamental set are called fundamental patterns. If it holds that x<i><sup>&micro;</sup></i> = y<i><sup>&micro;</sup></i> &#8704;<i>&micro;</i>&#8712;<i>{1,2,...,p},</i> then <b>M</b> is <i>autoassociative,</i> otherwise it is <i>heteroassociative.</i> In this latter case it is possible to establish that &#8707;<i>&micro;</i>&#8712;<i>{1,2,...,p}</i> for which <i>x<sup>&micro;</sup></i> &#8800; y<i><sup>&micro;</sup></i>. When feeding an unknown fundamental pattern x<sup><i>&omega;</i></sup> with <i>&omega;</i>&#8712;{1,2,...,p} to an associative memory <b>M</b>, it happens that the output corresponds exactly to the associated pattern y<sup><i>&omega;</i></sup> it is said that recall is correct.</font></p>      <p align="justify"><font face="verdana" size="2">The heart of the mathematical tools used in the Alpha&#45;Beta model, are two binary operators designed specifically for these memories. These </font><font face="verdana" size="2">operators are defined in<sup>15</sup> as follows: First, it is defined the sets <i>A</i> <b>= </b>{0,1} and <i>B</i> <b>= </b>{0,1,2}, then the operators <i>&alpha;: A x A &#8594; B</i> and <i>&beta;</i>: <i>A x B &#8594; A</i> are defined in <a href="#t1">Table 1</a>.</font></p>     <p align="center"><a name="t1"></a><img src="../img/revistas/rmib/v32n2/a4t1.jpg"></p>     <p align="justify"><font face="verdana" size="2">There exist two types of heteroassociative Alpha&#45;Beta memories, these are: type Max (v) and type Min (&#94;). The main difference of this two types is their tolerance to different kinds of alterations. For the generation of both types it will used the operator <img src="../img/revistas/rmib/v32n2/a4i1.jpg">, which is defined as follows:</font></p>  	    <p align="center"><img src="../img/revistas/rmib/v32n2/a4e0.jpg"></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Alpha&#45;Beta heteroassociative memories with correct recall</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Alpha&#45;Beta heteroassociative memories, unlike original and many other models<sup>15,13</sup>, guarantee the correct recall of the fundamental set<sup>16,17</sup>. This section shows the Alpha&#45;Beta heteroassociative memory type min, with which the complete recall of the fundamental set is guaranteed<sup>16</sup>. The Alpha&#45;Beta heteroassociative memory type max is obtained by duality.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Alpha&#45;Beta heteroassociative memory type min</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Let &Lambda; be an Alpha&#45;Beta heteroassociative memory type Min and {(x<i><sup>&micro;</sup>,</i>y<i><sup>&micro;</sup></i>)<i>&#124;&micro; = </i>1,2,...,p} its fundamental set with <i>x<sup>&micro;</sup> </i>&#8712;<i> A<sup>n</sup></i> and y<i><sup>&micro;</sup></i> &#8712; <i>A<sup>p</sup>, A = </i>{0,1}, <i>B = </i>{0,1,2}<i>,n,p </i>&#8712; <i>Z<sup>+</sup>.</i> The number of the components with value equal to zero of the <i>i</i>&#45;th row of &Lambda; is given by:<i> <img src="../img/revistas/rmib/v32n2/a4ea.jpg"></i> where <i>T </i>&#8712; <i>B<sup>n</sup></i> and its components are defined as:</font></p>      <p align="center"><img src="../img/revistas/rmib/v32n2/a4e1.jpg"></p>     <p align="justify"><font face="verdana" size="2">and the <i>r<sub>i</sub></i> components conform the min sum vector with <b>r</b> &#8712; <i>Z<sup>P16</sup></i>.</font></p>      <p align="justify"><font face="verdana" size="2"><b>a. Learning phase</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Let<b> <i>x</i></b> &#8712; <i>A<sup>n</sup></i> and <b><i>y</i></b> &#8712; <i>A<sup>p</sup></i> be input and output vectors, respectively. The corresponding fundamental set is denoted by {(<i>x<sup>&micro;</sup>,y<sup>&micro;</sup></i>)<i>&#124;&micro; = </i>1,2,...,p}<i>.</i> Which is built according with the following conditions: the y vectors are built with the <i>zero&#45;hot</i> codification: assigning for the output binary pattern <b>y</b> the following values: <i>y<sub>k</sub></i>= 0, and <i>y<sub>j</sub> = </i>1 for </font><font face="verdana" size="2"><i>j = </i>1,2,...,<i>k</i>&#45;1, k + 1,...,p where <i>k</i>&#8712;{1,2,.<i>..,p</i>}. </font><font face="verdana" size="2">And, to each y" vector correspond <i>one and only one</i> <i>x<sup>&micro;</sup></i> vector.</font></p>     <p align="justify"><font face="verdana" size="2">For each <i>&micro;</i> &#8712; {1,2,...p}<i>,</i> from the pair (<i><b>x</b><sup>&micro;</sup>,<b>y</b><sup>&micro;</sup></i>)<i> </i>build the matrix: <img src="../img/revistas/rmib/v32n2/a4e1a.jpg"> then the min binary operator (&#94;) is applied to the resulting matrices. Therefore, the A matrix is obtained as follow:</font></p>      <p align="justify"><img src="../img/revistas/rmib/v32n2/a4e1b.jpg"> <font face="verdana" size="2">where the component in the <i>i</i>&#45;th row and <i>j</i>&#45;th column is given by:</font></p>     ]]></body>
<body><![CDATA[<p align="center"><i><font size="2" face="verdana"><img src="../img/revistas/rmib/v32n2/a4e1c.jpg"></font></i></p>     <p align="justify"><font face="verdana" size="2"><b>b. Recalling phase</b></font></p>  	    <p align="justify"><font face="verdana" size="2">A pattern <i>x<sup>&omega;</sup></i> is presented to &#8711;<sub><i>&beta;</i></sub> the operation is done and the resulting vector is assigned to a vector called <i>z<sup>&omega;</sup></i><b>: </b>z<i><sup>&omega;</sup></i> = &Lambda;&#8711;<sub><i>&beta;</i></sub>,x<i><sup>&omega;</sup></i>. The <i>i&#45;</i>th component of the resulting column vector are:</font> <img src="../img/revistas/rmib/v32n2/a4e1d.jpg"></p>      <p align="justify"><font face="verdana" size="2">It is necessary to build the <i>min sum vector</i> <b>r,</b> therefore the corresponding <i>y<sup>&omega;</sup></i> is given as:</font></p>      <p align="center"><img src="../img/revistas/rmib/v32n2/a4e2.jpg"></p>     <p align="center">&nbsp;</p>     <p align="justify"><font face="verdana" size="2"><b>Robust retrieval associative memory</b></font></p>  	    <p align="justify"><font face="verdana" size="2">In this section, we describe a model, merging Alpha&#45;Beta associative memories<sup>16</sup> and Needleman&#45;Wunsch algorithm<sup>18</sup>. With this it is possible to handle input patterns of different sizes for both learning and recall phase, keeping the main property: correct recall of the fundamental set.</font></p>  	    <p align="justify"><font face="verdana" size="2">First at all is important to define this: Let <i>X<sup>&alpha;</sup></i> &#8712; <i>A<sup>n</sup></i> with <i>A</i> = {0,1}, <i>n</i> &#8712; <i>Z<sup>+</sup>,</i> and <i>&alpha;</i> &#8712; {1,2,...,p} be a row vector. Let <i>q</i> &#8712; <i>Z<sup>+</sup></i> be the dimension of the new smaller vectors extracted from <i>X<sup>&alpha;</sup>.</i> The <i>vectorial partition operation &rho;{x<sup>&alpha;</sup>,q)</i> is defined as the set of binary row vectors <i>q</i>&#45;dimensionalandisdenoted as follow:</font></p>      <p align="center"><img src="../img/revistas/rmib/v32n2/a4e3.jpg"></p>      ]]></body>
<body><![CDATA[<p align="justify">&nbsp;</p>     <p align="justify"><font face="verdana" size="2"><b>Robust Alpha&#45;Beta heteroassociative memory type min</b></font></p>  	    <p align="justify"><font face="verdana" size="2"><b>1. Learning phase</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Let <b>x</b> &#8712; <i>A<sup>n</sup></i> and <b>y</b><i> </i>&#8712; <i>A<sup>p</sup></i> with <i>n, p</i> &#8712; Z<sup>+</sup>, be an input and output vectors, respectively. The corresponding fundamental set is denoted by {(<i>x<sup>&micro;</sup>,y<sup>&micro;</sup></i>)<i>&#124;&micro; = </i>1,2,...,p}<i> </i>such that &#8707;<i>&delta;, &sigma; </i>&#8712; {<i>1,  2,...., p</i>} where x<sup>&delta;</sup>&#8712; <i>A<sup>b</sup>,</i> x<sup>&sigma;</sup> &#8712; <i>A<sup>c</sup></i> with <i>b, c</i> &#8712; <i>Z<sup>+</sup></i> and <i>b&#8800;c.</i> Moreover, the <b>y</b> vectors are built with the <i>zero&#45;hot</i> codification: assigning for y<sup>&micro;</sup> the following values: <i>y<sup>&micro;</sup><sub>k</sub> =</i> 0, and <i>y<sup>&micro;</sup><sub>j</sub> =1</i> for <i>j = 1, 2,..., k&#45;1, k + 1,...,p</i> where <i>k</i> &#8712; {1,2,...,p}. And to each <i><b>y</b></i><sup><i>&micro;</i></sup> vector correspond <i>one and only one</i> <b>x</b><sup><i>&micro;</i></sup> vector.</font></p>      <p align="justify"><font face="verdana" size="2">For each <i>&micro;</i> &#8712; {<i>1,  2,...., p</i>}<i>,</i> from the couple <i>(x<sup>&micro;</sup>,y<sup>&micro;</sup>)</i> build the matrix:<i> <img src="../img/revistas/rmib/v32n2/a4e1a.jpg"></i> then, the min binary operator is applied to the matrices. Therefore, the <b>A</b> matrix is obtained as follow:</font></p>     <p align="justify"><img src="../img/revistas/rmib/v32n2/a4e3a.jpg"> <font face="verdana" size="2">where the component in the<i> ij</i>&#45;th component is given by <img src="../img/revistas/rmib/v32n2/a4e3b.jpg"></font></p>     <p align="justify"><font face="verdana" size="2"><b>2. Recall phase</b></font></p>  	    <p align="justify"><font face="verdana" size="2">A fundamental pattern <i>x<sup>&omega;</sup>,</i> that can or not be of different size from other fundamental patterns, is presented to &Lambda;, then the vector <b>y</b> &#8712; <i>A<sup>P</sup></i> is built as follows:</font></p>      <p align="justify"><font face="verdana" size="2">First, the vectorial partition operation is applied to each <i>&lambda;<sub>i</sub></i> &#8712; <i>B<sup>&alpha;</sup></i> and to <i><b>x</b><sup>&omega;</sup></i> &#8712; <i>A<sup>b</sup></i> where <i>&alpha;</i> and <i>&beta; </i>belong to the fundamental set.</font></p>      <p align="center"><img src="../img/revistas/rmib/v32n2/a4e4.jpg"></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">then the <b>F</b><sup>1</sup> is the matrix built as follow: </font></p>     <p align="center"><img src="../img/revistas/rmib/v32n2/a4e7.jpg"></p>     <p align="justify"><font face="verdana" size="2">With <i>F<sub>c,0 </sub>= c*d,F<sub>0,h </sub>= h*d</i> with <i>d</i> &#8712; <i>Z<sup>&#45;</sup></i> being a penalization known as gap, and <b>&eta;</b> being a factor to increase the result of Alpha&#45;Beta memory recognition.</font></p>      <p align="justify"><font face="verdana" size="2">Once the <i>z</i> vector has been built, the <i>sum min vector</i> <i>r</i> &#8712; <i>Z<sup>p</sup></i> is computed. It contain in its <i>i&#151;</i> th component the amount of zeros of the <i>i&#45;th</i> row of the &Lambda; matrix.</font></p>      <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9.jpg"></p>     <p align="justify"><font face="verdana" size="2">where <i>T </i>&#8712; <i>B<sup>n</sup></i> and its components are defined as:</font></p>      <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9a.jpg"></p>      <p align="justify"><font face="verdana" size="2">&#8704;<i>j</i> &#8712; {1,2,...,<i>n</i>}. Therefore the corresponding <i>Y<sup>&micro;</sup></i> is given as:</font></p>      <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9b.jpg"></p>     <p align="justify"><font face="verdana" size="2"><i>Example 1: Let</i> <i>x<sup>1</sup>,x<sup>2</sup> ,X<sup>3</sup> ,x<sup>4</sup></i> <i>be the input patterns</i></font></p>      ]]></body>
<body><![CDATA[<p align="center"><img src="../img/revistas/rmib/v32n2/a4e9c.jpg"></p>     <p align="justify"><font face="verdana" size="2">and the corresponding output vectors</font></p>     <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9d.jpg"></p>      <p align="justify"><font face="verdana" size="2">the output vectors are built with the Zero&#45;Hot codification, and to each output pattern corresponds one and <i>only one</i> input pattern, therefore the fundamental set is expressed as follow:</font></p>  	    <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9e.jpg"></p>      <p align="justify"><font face="verdana" size="2">Once the fundamental set is made, the learning phase of the new algorithm is applied:</font></p>     <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9f.jpg"></p>     <p align="justify"><font face="verdana" size="2">The binary operator min &#94; is applied to the matrices obtained before to build the matrix &Lambda;:</font></p>  	    <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9g.jpg"></p>     <p align="justify"><font face="verdana" size="2">Once the &Lambda;: matrix is generated, to recall <i>x<sup>&omega;</sup></i> with <i>&omega;</i> &#8712; {1,2,3,<i>...,p</i>} particularly <i>&omega; =</i> 4, <i>x</i><sup>4</sup> is presented to &Lambda;. First, the vectorial partition operator is applied to vector<i> x<sup>&omega;</sup> </i>and <i>&lambda;<sub>i</sub></i> with <i>q = 3:</i></font></p>     ]]></body>
<body><![CDATA[<p align="center"><img src="../img/revistas/rmib/v32n2/a4e9h.jpg"></p>     <p align="justify"><font face="verdana" size="2">Then, for each <i>i </i>&#8712; {1,2,<i>...,p</i>} and <i>d = </i>&#45;1 and <i>&eta;</i> = 3, the <i>F<sup>i</sup></i> matrices are built as follow:</font></p>     <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9i.jpg"></p>     <p align="justify"><font face="verdana" size="2">Then, the matrix <i>F<sup>1</sup></i> is:</font></p>     <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9j.jpg"></p>     <p align="justify"><font face="verdana" size="2">The calculus of each component for the matrices <i>F<sup>2</sup>,F<sup>3</sup>,F<sup>4</sup></i> is not explicitly expressed, however the matrices are shown here:</font></p>     <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9k.jpg"></p>     <p align="justify"><font face="verdana" size="2">the resulting vector could be or not an output patternfromthe fundamental set, in other words, it could be an ambiguous pattern. According with the recall phase, the resulting vector is known as <i>z</i><sup>4</sup> then the <i>min sum vector</i> <i><b>r</b></i> must be built:</font></p>  	    <p align="center"><img src="../img/revistas/rmib/v32n2/a4e9l.jpg"></p>     <p align="justify"><font face="verdana" size="2">after that, the output pattern <i>y</i><sup>4</sup><i>:</i></font></p>      ]]></body>
<body><![CDATA[<p align="center"><img src="../img/revistas/rmib/v32n2/a4e9m.jpg"></p>      <p align="justify"><font face="verdana" size="2">due to the minimum value of <i>r<sub>j</sub></i> where <i>z<sub>j</sub></i><sup>4</sup> = 0&#8704;<i>j</i> &#8712;{1,2,3,4} is 4.</font></p>     <p align="justify">&nbsp;</p>      <p align="justify"><font face="verdana" size="2"><b>RESULTS</b></font></p>  	    <p align="justify"><font face="verdana" size="2">This section reports an experimental study of the model. The experimentation was made for both aminoacid and DNA sequences. For aminoacid sequences the dataset was created from NCBI repository and for DNA sequences the datasource was obtained from the Machine Learning Repository of the University of California in Irvine<sup>20</sup>. The proposed model requires that<i> &eta; </i>and <i>q</i> are given. A simple implementation of the model is used to test it. In the following test, <i>&eta; = 5</i> and <i>q = </i>1 are used along some small&#45;scale data sources.</font></p>      <p align="justify"><font face="verdana" size="2">As mentioned before the learning data sources of aminoacid sequences was obtained from the NCBI. The organism selected was <i>Variovorax paradoxus</i> S110 chromosome 1. In order to use the proposed model, it is necessary to relate the aminoacid characters into binary sequences, in <a href="../img/revistas/rmib/v32n2/a4t2.jpg" target="_blank">Table 2</a> shows such relation. They were created using the known blosum62 substitution matrix<sup>19</sup>. Actually, there are two mappings, one for coding the learning patterns and other for recalling patterns. The mappings were built by sorting the 24 characters (20 aminoacids and 4 wildcard) and assigning to each one a twenty four dimensional vector, each component correspond to one of the 24 characters of the aminoacids. Then it was assigned a number one in the component where in the blosum62 matrix the character pair has a positive value, On otherwise. Its main objective is to give information to the model about the most probably changes between aminoacids. To experimentally show that this proposal fulfills the correct recall property of the fundamental set, we have built four different fundamental sets of cardinality p. With these fundamental sets the associative memories are built. Then, the same set of learning patterns are presented to the memory. The recall percentages are shown in <a href="#t3">Table 3</a>.</font></p>     <p align="center"><a name="t3"></a><img src="../img/revistas/rmib/v32n2/a4t3.jpg"></p>     <p align="justify"><font face="verdana" size="2">It is also important to know how the algorithm behave when altered patterns are presented to the associative memory. Based on a fundamental set of the file p50.txt, six altered versions of it were built. The alterations of the fundamental set are shown in <a href="#t5">Table 5</a> and the results are shown in <a href="#t4">Table 4</a>.</font></p>     <p align="center"><a name="t4"></a><img src="../img/revistas/rmib/v32n2/a4t4.jpg"></p>     <p align="center"><a name="t5"></a><img src="../img/revistas/rmib/v32n2/a4t5.jpg"></p>      ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><a href="#t5">Table 5</a> shows the alteration made randomly to the original fundamental set.</font></p>  	    <blockquote> 	      <p align="justify"><font face="verdana" size="2">1.&nbsp;Alteration: indicates the percentage of changes in the sequence. The changes could be mutation, insertion, and deletion.</font></p> 	      <p align="justify"><font face="verdana" size="2">2.&nbsp;Mutation: percentage of substitution of an aminoacid by other</font>.</p> 	      <p align="justify"><font face="verdana" size="2">3.&nbsp;Insertion: percentage of insertion of an amino&#45;acid in the sequence.</font></p> 	      <p align="justify"><font face="verdana" size="2">4.&nbsp;Deletion: percentage of deletion of an amino&#45;acid in the sequence.</font></p> </blockquote>      <p align="justify"><font face="verdana" size="2">In the other hand, to test the performance of the model with DNA sequences, the promoters and splice&#45;junction samples were taken from the <i>&laquo;E. coli</i> promoter gene sequences (DNA) with associated imperfect domain theory&raquo; and &laquo;Primate splice&#45;junction gene sequences (DNA) with associated imperfect domain theory&raquo; datasources, consecutively.</font></p>     <p align="justify"><font face="verdana" size="2">The promoter database has 106 instances split intotwoclasses,promotersandnon&#45;promoters,53 instances to each one. The sequences are formed by 57 nucleotides and its binary codification is shown in <a href="#t6">Table 6</a>. According with<sup>21</sup> the One&#45;Hot codification is one of the most benefical.</font></p>     <p align="center"><a name="t6"></a><img src="../img/revistas/rmib/v32n2/a4t6.jpg"></p>      <p align="justify"><font face="verdana" size="2"><a href="#t7">Table 7</a> shows the percentage of recall on DNA datasets altered with some percentage of the alteration defined before. It is clear that, even when the alterations change the original sequence in both composition and dimension, the new model support this kind of modification and preserve its recall capacity.</font></p>     ]]></body>
<body><![CDATA[<p align="center"><a name="t7"></a><img src="../img/revistas/rmib/v32n2/a4t7.jpg"></p>      <p align="justify"><font face="verdana" size="2"><a href="../img/revistas/rmib/v32n2/a4t8.jpg" target="_blank">Table 8</a> shows the alterations made to the DNA sequences from the original datasource.</font></p>     <p align="justify"><font face="verdana" size="2">Finally, it could be interesting to use the experimental datasources with the original model of associative memories. However, by the nature of the original model it is not possible due to the fact that the input and output vectors should be all the same dimension.</font></p>  	    <p align="justify">&nbsp;</p>     <p align="justify"><font face="verdana" size="2"><b>CONCLUSION AND FUTURE WORK</b></font></p>  	    <p align="justify"><font face="verdana" size="2">In this work, a model for retrieval of aminoacid and DNA sequences from a data sources is proposed. The model ensures the correct recall of the fundamental set, this is the complete set of patterns learned. Moreover, unlike previous models of associative memories, it is capable of supporting some degree of the three type of alterations on the patterns: <i>mutation, deletion,</i> and <i>insertion,</i> as shown on <a href="#t4">Table 4</a>. To do so, a relational table for aminoacids character to binary sequences is proposed. It is possible to use this model in any medical task that requires analysis of DNA or aminoacid sequences; no matter if some sequences has alterations. This model is capable of handling patterns of different sizes for learning and recall.</font></p>  	    <p align="justify"><font face="verdana" size="2">As future work it is important to develop an efficient software that implements the given model. It might be helpful to use the heteroassociative memory type Max to compare the advantages and disadvantages against the proposed model. Test a modified version of the model using the Smith&#45;Waterman algorithm for local alignment. Develop experiments with several ranges of evolutionary proximity.</font></p>     <p align="justify">&nbsp;</p>      <p align="justify"><font face="verdana" size="2"><b>ACKNOWLEDGEMENTS</b></font></p>  	    <p align="justify"><font face="verdana" size="2">The authors would like to thank the Instituto Polit&eacute;cnico Nacional (Secretar&iacute;a Acad&eacute;mica, COFAA, SIP, and CIC), the CONACyT, the SNI, and the ICyTDF (grants PIUTE10&#45;77 and PICSO10&#45;85) for their economical support to develop this work.</font></p>     ]]></body>
<body><![CDATA[<p align="justify">&nbsp;</p>      <p align="justify"><font face="verdana" size="2"><b>REFERENCES</b></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">1.&nbsp;Baldi P Brunak S. Bioinformatics: the machine learning approach. MIT Press, (Cambridge) 2001.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507032&pid=S0188-9532201100020000400001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">2.&nbsp;Von Heijne G. Sequence analysis in molecular biology: Treasure trove or trivial pursuit. Academic Press (London), 1987.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507034&pid=S0188-9532201100020000400002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">3.&nbsp;Doolittle RF. Of URFs and ORFs: A primer on how to analyze derived amino acid sequences. University Science Books, Mill Valley (California) 1986.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507036&pid=S0188-9532201100020000400003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">4.&nbsp;Wolfsberg TG, Wetterstrand KA, Guyer MS, Collins FS, Baxevanis AD. &laquo;A user's guide to the human genome&raquo;. Nature Genetics 2002; 32: 1&#45;79.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507038&pid=S0188-9532201100020000400004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">5.&nbsp;Setubal J, Meidanis J. Introduction to computational molecular biology. International Thomson Publishing (Boston, </font><font face="verdana" size="2">MA), 1999.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507040&pid=S0188-9532201100020000400005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">6.&nbsp;Lesk AM. Introduction to Bioinformatics. Oxford University Press, </font><font face="verdana" size="2">2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507042&pid=S0188-9532201100020000400006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">7.&nbsp;Ray SS, Bandyopadhyay S, Mitra P, Pal SK. &laquo;Bioinformatics in neurocomputing framework&raquo;. Circuits, Devices and Systems, IEE Proceedings 2005; 152(5): 556&#45;564.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507044&pid=S0188-9532201100020000400007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">8.&nbsp;Mitra S, Hayashi Y. &laquo;Bioinformatics with soft computing&raquo;. Systems, man, and cybernetics, Part C. Applications and Reviews, IEEE Transactions 2006; 36(5): 616&#45;635.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507046&pid=S0188-9532201100020000400008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">9.&nbsp;Salzberg SL, Searls DB, Kasif S. Computational methods in molecular biology. Elsevier Science 1998.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507048&pid=S0188-9532201100020000400009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">10.&nbsp;Zepeda HM, Perea&#45;Araujo L, Zarate&#45;Segura PB, V&aacute;zquez&#45;P&eacute;rez JA, Miliar&#45;Garc&iacute;a A, Garibay&#45;Orijel C, Dom&iacute;nguez&#45;L&oacute;pez A, Badillo&#45;Corona JA, L&oacute;pez&#45;Orduna E, Garc&iacute;a&#45;Gonz&aacute;lez OP Villasenor&#45;Ruiz I, Ahued&#45;Ortega A, Aguilar&#45;Faisal L, Bravo J, Lara&#45;Padilla E, Garc&iacute;a&#45;Cavazos RJ. Identification of influenza: A pandemic (H1N1) 2009 variants during the first 2009 influenza outbreak in Mexico City. Journal of Clinical Virology: the Official Publication of the Pan American Society for Clinical Virology 2010; 48: 36&#45;39.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507050&pid=S0188-9532201100020000400010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">11.&nbsp;Zepeda&#45;L&oacute;pez HM, Perea&#45;Araujo L, Miliar&#45;Garc&iacute;a AA, Dom&iacute;nguez&#45;L&oacute;pez B, Xoconostle&#45;Cazarez E, Lara&#45;Padilla JA, Ram&iacute;rez&#45;Hern&aacute;ndez E, Sevilla&#45;Reyes ME, Orozco A, Ahued&#45;Ortega I, Villasenor&#45;Ruiz RJ, Garc&iacute;a&#45;Cavazos, Teran LM. Inside the outbreak of the 2009 influenza A (H1N1) virus in Mexico. PloS One 2010; 5: e13256.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507052&pid=S0188-9532201100020000400011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">12.&nbsp;Hassoun MH. Associative neural memories: Theory and implementation. Oxford University Press (New York), 1993.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507054&pid=S0188-9532201100020000400012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">13.&nbsp;Ritter GX, Sussner P, D&iacute;az&#45;de&#45;Le&oacute;n JL. &laquo;Morphological Associative Memories&raquo;. IEEE Transactions on Neural Networks 1998; 9(2): 281&#45;293.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507056&pid=S0188-9532201100020000400013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">14.&nbsp;Hopfield JJ. &laquo;Neural networks and physical systems with emergent collective computational abilities&raquo;. Biophysics </font><font face="verdana" size="2">1982; 79: 2554&#45;2558.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507058&pid=S0188-9532201100020000400014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">15.&nbsp;Y&aacute;&ntilde;ez&#45;M&aacute;rquez C. Associative memories based on order relations and binary operators. PhD Thesis, Center for Computing Research, National Polytechnic Institute, Mexico, D.F., 2002.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507060&pid=S0188-9532201100020000400015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">16.&nbsp;Rom&aacute;n&#45;God&iacute;nez I, Y&aacute;&ntilde;ez&#45;M&aacute;rquez C. &laquo;Complete Recall on Alpha&#45;Beta Heteroassociative Memory&raquo;. Lecture Notes. Computer Science 2007; 4827: 193&#45;202.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507062&pid=S0188-9532201100020000400016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">17.&nbsp;Rom&aacute;n&#45;God&iacute;nez I, L&oacute;pez&#45;Y&aacute;&ntilde;ez I, Y&aacute;&ntilde;ez&#45;M&aacute;rquez C. &laquo;Classifying patterns in bioinformatics databases by using Alpha&#45;Beta associative memories&raquo;. In: Amandeep S, Sidhu&#45;Tharam SD, editors. Biomedical Data and Applications in Studies in Computational Intelligence. Springer 2009; 187&#45;210.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507064&pid=S0188-9532201100020000400017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">18.&nbsp;Needleman SB, Wunsch CD. &laquo;A general method applicable to the search for similarities in the amino acid sequence of two proteins&raquo;. Journal of Molecular Biology 1970; 48(3): </font><font face="verdana" size="2">443&#45;453.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507066&pid=S0188-9532201100020000400018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font>	</p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">19.&nbsp;Henikoff S, Henikoff JG. &laquo;Amino acid substitution matrices from protein blocks&raquo;. Proc Natl Acad Sci U SA 1992; 89(22): </font><font face="verdana" size="2">10915&#45;10919.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507068&pid=S0188-9532201100020000400019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">20.&nbsp;Asuncion A, Newman DJ. UCI Machine Learning Repository, Irvine, CA: University of California, Department of Informa</font><font face="verdana" size="2">tion and Computer Science. Available at: <a href="http://www.ics.uci.edu/~mlearn/MLRepository.html" target="_blank">http://www.ics.uci.edu/~mlearn/MLRepository.html</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507070&pid=S0188-9532201100020000400020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">21. Brunak S, Engelbrecht J, Knudsen S. &laquo;Prediction of human mRNA donor and acceptor sites from the DNA sequence&raquo;. J Mol Biol 1991; 220: 49&#45;65.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=8507071&pid=S0188-9532201100020000400021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <p align="justify">&nbsp;</p> 	    <p align="justify"><font size="2" face="verdana"><b>Nota</b></font></p>         <p align="justify"><font face="verdana" size="2">Este art&iacute;culo tambi&eacute;n puede ser consultado en versi&oacute;n completa en: <a href="http://www.medigraphic.com/ingenieriabiomedica/" target="_blank">http://www.medigraphic.com/ingenieriabiomedica/</a></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Baldi]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Brunak]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Bioinformatics: the machine learning approach]]></source>
<year>2001</year>
<publisher-loc><![CDATA[Cambridge ]]></publisher-loc>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Von Heijne]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Sequence analysis in molecular biology: Treasure trove or trivial pursuit]]></source>
<year>1987</year>
<publisher-loc><![CDATA[London ]]></publisher-loc>
<publisher-name><![CDATA[Academic Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Doolittle]]></surname>
<given-names><![CDATA[RF.]]></given-names>
</name>
</person-group>
<source><![CDATA[Of URFs and ORFs: A primer on how to analyze derived amino acid sequences]]></source>
<year>1986</year>
<publisher-loc><![CDATA[Mill Valley^eCalifornia California]]></publisher-loc>
<publisher-name><![CDATA[University Science Books]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wolfsberg]]></surname>
<given-names><![CDATA[TG]]></given-names>
</name>
<name>
<surname><![CDATA[Wetterstrand]]></surname>
<given-names><![CDATA[KA]]></given-names>
</name>
<name>
<surname><![CDATA[Guyer]]></surname>
<given-names><![CDATA[MS]]></given-names>
</name>
<name>
<surname><![CDATA[Collins]]></surname>
<given-names><![CDATA[FS]]></given-names>
</name>
<name>
<surname><![CDATA[Baxevanis]]></surname>
<given-names><![CDATA[AD.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A user's guide to the human genome]]></article-title>
<source><![CDATA[Nature Genetics]]></source>
<year>2002</year>
<volume>32</volume>
<page-range>1-79</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Setubal]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Meidanis]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Introduction to computational molecular biology]]></source>
<year>1999</year>
<publisher-loc><![CDATA[Boston^eMA MA]]></publisher-loc>
<publisher-name><![CDATA[International Thomson Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lesk]]></surname>
<given-names><![CDATA[AM.]]></given-names>
</name>
</person-group>
<source><![CDATA[Introduction to Bioinformatics]]></source>
<year>2008</year>
<publisher-name><![CDATA[Oxford University Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ray]]></surname>
<given-names><![CDATA[SS]]></given-names>
</name>
<name>
<surname><![CDATA[Bandyopadhyay]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Mitra]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Pal]]></surname>
<given-names><![CDATA[SK]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[«Bioinformatics in neurocomputing framework». Circuits, Devices and Systems]]></article-title>
<source><![CDATA[IEE Proceedings]]></source>
<year>2005</year>
<volume>152</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>556-564</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mitra]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Hayashi]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Bioinformatics with soft computing». Systems, man, and cybernetics, Part C. Applications and Reviews]]></article-title>
<source><![CDATA[IEEE Transactions]]></source>
<year>2006</year>
<volume>36</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>616-635</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Salzberg]]></surname>
<given-names><![CDATA[SL]]></given-names>
</name>
<name>
<surname><![CDATA[Searls]]></surname>
<given-names><![CDATA[DB]]></given-names>
</name>
<name>
<surname><![CDATA[Kasif]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computational methods in molecular biology]]></source>
<year>1998</year>
<publisher-name><![CDATA[Elsevier Science]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zepeda]]></surname>
<given-names><![CDATA[HM]]></given-names>
</name>
<name>
<surname><![CDATA[Perea-Araujo]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Zarate-Segura]]></surname>
<given-names><![CDATA[PB]]></given-names>
</name>
<name>
<surname><![CDATA[Vázquez-Pérez]]></surname>
<given-names><![CDATA[JA]]></given-names>
</name>
<name>
<surname><![CDATA[Miliar-García]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Garibay-Orijel]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Domínguez-López]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Badillo-Corona]]></surname>
<given-names><![CDATA[JA]]></given-names>
</name>
<name>
<surname><![CDATA[López-Orduna]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
<name>
<surname><![CDATA[García-González]]></surname>
<given-names><![CDATA[OP]]></given-names>
</name>
<name>
<surname><![CDATA[Villasenor-Ruiz]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[Ahued-Ortega]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Aguilar-Faisal]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Bravo]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Lara-Padilla]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
<name>
<surname><![CDATA[García-Cavazos]]></surname>
<given-names><![CDATA[RJ.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Identification of influenza: A pandemic (H1N1) 2009 variants during the first 2009 influenza outbreak in Mexico City]]></article-title>
<source><![CDATA[Journal of Clinical Virology: the Official Publication of the Pan American Society for Clinical Virology]]></source>
<year>2010</year>
<volume>48</volume>
<page-range>36-39</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zepeda-López]]></surname>
<given-names><![CDATA[HM]]></given-names>
</name>
<name>
<surname><![CDATA[Perea-Araujo]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Miliar-García]]></surname>
<given-names><![CDATA[AA]]></given-names>
</name>
<name>
<surname><![CDATA[Domínguez-López]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Xoconostle-Cazarez]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
<name>
<surname><![CDATA[Lara-Padilla]]></surname>
<given-names><![CDATA[JA]]></given-names>
</name>
<name>
<surname><![CDATA[Ramírez-Hernández]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
<name>
<surname><![CDATA[Sevilla-Reyes]]></surname>
<given-names><![CDATA[ME]]></given-names>
</name>
<name>
<surname><![CDATA[Orozco]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Ahued-Ortega]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[Villasenor-Ruiz]]></surname>
<given-names><![CDATA[RJ]]></given-names>
</name>
<name>
<surname><![CDATA[García-Cavazos]]></surname>
</name>
<name>
<surname><![CDATA[Teran]]></surname>
<given-names><![CDATA[LM.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Inside the outbreak of the 2009 influenza A (H1N1) virus in Mexico]]></article-title>
<source><![CDATA[PloS One]]></source>
<year>2010</year>
<volume>5</volume>
<page-range>e13256</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hassoun]]></surname>
<given-names><![CDATA[MH.]]></given-names>
</name>
</person-group>
<source><![CDATA[Associative neural memories: Theory and implementation]]></source>
<year>1993</year>
<publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Oxford University Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ritter]]></surname>
<given-names><![CDATA[GX]]></given-names>
</name>
<name>
<surname><![CDATA[Sussner]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Díaz-de-León]]></surname>
<given-names><![CDATA[JL.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Morphological Associative Memories]]></article-title>
<source><![CDATA[IEEE Transactions on Neural Networks]]></source>
<year>1998</year>
<volume>9</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>281-293</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hopfield]]></surname>
<given-names><![CDATA[JJ.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Neural networks and physical systems with emergent collective computational abilities]]></article-title>
<source><![CDATA[Biophysics]]></source>
<year>1982</year>
<volume>79</volume>
<page-range>2554-2558</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yáñez-Márquez]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Associative memories based on order relations and binary operators]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Román-Godínez]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[Yáñez-Márquez]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Complete Recall on Alpha-Beta Heteroassociative Memory]]></article-title>
<source><![CDATA[Lecture Notes. Computer Science]]></source>
<year>2007</year>
<volume>4827</volume>
<page-range>193-202</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Román-Godínez]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[López-Yáñez]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[Yáñez-Márquez]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Classifying patterns in bioinformatics databases by using Alpha-Beta associative memories]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Amandeep]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Sidhu-Tharam]]></surname>
<given-names><![CDATA[SD]]></given-names>
</name>
</person-group>
<source><![CDATA[Biomedical Data and Applications in Studies in Computational Intelligence]]></source>
<year>2009</year>
<page-range>187-210</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Needleman]]></surname>
<given-names><![CDATA[SB]]></given-names>
</name>
<name>
<surname><![CDATA[Wunsch]]></surname>
<given-names><![CDATA[CD.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A general method applicable to the search for similarities in the amino acid sequence of two proteins]]></article-title>
<source><![CDATA[Journal of Molecular Biology]]></source>
<year>1970</year>
<volume>48</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>443-453</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Henikoff]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Henikoff]]></surname>
<given-names><![CDATA[JG.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Amino acid substitution matrices from protein blocks]]></article-title>
<source><![CDATA[Proc Natl Acad Sci]]></source>
<year>1992</year>
<volume>89</volume>
<numero>22</numero>
<issue>22</issue>
<page-range>10915-10919</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Asuncion]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Newman]]></surname>
<given-names><![CDATA[DJ.]]></given-names>
</name>
</person-group>
<source><![CDATA[UCI Machine Learning Repository]]></source>
<year></year>
<publisher-loc><![CDATA[Irvine^eCA CA]]></publisher-loc>
<publisher-name><![CDATA[University of California, Department of Information and Computer Science]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brunak]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Engelbrecht]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Knudsen]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Prediction of human mRNA donor and acceptor sites from the DNA sequence]]></article-title>
<source><![CDATA[J Mol Biol]]></source>
<year>1991</year>
<volume>220</volume>
<page-range>49-65</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
