<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462014000100003</article-id>
<article-id pub-id-type="doi">10.13053/CyS-18-1-2014-016</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[Agregación de medidas de similitud para la detección de ortólogos: validación con medidas basadas en la teoría de conjuntos aproximados]]></article-title>
<article-title xml:lang="en"><![CDATA[Aggregation of Similarity Measures for Ortholog Detection: Validation with Measures Based on Rough Set Theory]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Millo Sánchez]]></surname>
<given-names><![CDATA[Reinier]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Galpert Cañizares]]></surname>
<given-names><![CDATA[Deborah]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Casa Cardoso]]></surname>
<given-names><![CDATA[Gladys]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Grau Ábalo]]></surname>
<given-names><![CDATA[Ricardo]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Arco García]]></surname>
<given-names><![CDATA[Leticia]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[García Lorenzo]]></surname>
<given-names><![CDATA[María Matilde]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Fernandez Marin]]></surname>
<given-names><![CDATA[Miguel Ángel]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Universidad Central Marta Abreu de Las Villas  ]]></institution>
<addr-line><![CDATA[Santa Clara ]]></addr-line>
<country>Cuba</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Universidad de las Ciencias Informáticas  ]]></institution>
<addr-line><![CDATA[La Habana ]]></addr-line>
<country>Cuba</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>03</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>03</month>
<year>2014</year>
</pub-date>
<volume>18</volume>
<numero>1</numero>
<fpage>19</fpage>
<lpage>35</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462014000100003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462014000100003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462014000100003&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="es"><p><![CDATA[En el presente trabajo se propone un algoritmo para la detección de ortólogos que utiliza la agregación de medidas de similitud para caracterizar la relación entre los pares de genes de dos genomas. Las medidas se basan en la puntuación del alineamiento, la longitud de las secuencias, la pertenencia a regiones conservadas y el perfil físico-químico de las proteínas. La fase de agrupamiento sobre el grafo bipartido de similitudes se realiza con el algoritmo de agrupamiento de Markov (MCL). Se define una política de asignación de ortólogos a partir de los grupos de homología obtenidos del agrupamiento. La clasificación se valida con los genomas de Saccharomyces Cerevisiae y de Schizosaccharomyces Pombe usando la lista de ortólogos del algoritmo INPARANOID 7.0, con la medida de validación externa ARI. También se aplican medidas de validación empleando la teoría de conjuntos aproximados para medir la calidad con manejo del desbalance de las clases.]]></p></abstract>
<abstract abstract-type="short" xml:lang="en"><p><![CDATA[This paper presents a novel algorithm for ortholog detection that involves the aggregation of similarity measures characterizing the relationship between gene pairs of two genomes. The measures are based on the alignment score, the length of the sequences, the membership in the conserved regions as well as on the protein physicochemical profile. The clustering step over the similarity bipartite graph is performed by using the Markov clustering algorithm (MCL). A new ortholog assignment policy is applied over the homology groups obtained in the graph clustering. The classification results are validated with the Saccharomyces Cerevisiae and the Schizosaccharomyces Pombe genomes with the ortholog list of the INPARANOID 7.0 algorithm with the Adjusted Rand Index (ARI) external measure. Other validation measures based on the rough set theory are applied to calculate the quality of the classification dealing with class imbalance.]]></p></abstract>
<kwd-group>
<kwd lng="es"><![CDATA[Medidas de similitud]]></kwd>
<kwd lng="es"><![CDATA[genes ortólogos]]></kwd>
<kwd lng="es"><![CDATA[agrupamiento mcl]]></kwd>
<kwd lng="es"><![CDATA[asignación de ortólogos]]></kwd>
<kwd lng="es"><![CDATA[teoría de conjuntos aproximados]]></kwd>
<kwd lng="es"><![CDATA[desbalance de las clases]]></kwd>
<kwd lng="en"><![CDATA[Similarity measures]]></kwd>
<kwd lng="en"><![CDATA[ortholog genes]]></kwd>
<kwd lng="en"><![CDATA[mcl clustering]]></kwd>
<kwd lng="en"><![CDATA[ortholog assignment]]></kwd>
<kwd lng="en"><![CDATA[rough set theory]]></kwd>
<kwd lng="en"><![CDATA[class imbalance]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="justify"><font face="verdana" size="4">Art&iacute;culos</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="4"><b>Agregaci&oacute;n de medidas de similitud para la detecci&oacute;n de ort&oacute;logos: validaci&oacute;n con medidas basadas en la teor&iacute;a de conjuntos aproximados</b></font></p>  	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="3"><b>Aggregation of Similarity Measures for Ortholog Detection: Validation with Measures Based on Rough Set Theory</b></font></p>  	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="2"><b>Reinier Millo S&aacute;nchez<sup>1</sup>, Deborah Galpert Ca&ntilde;izares<sup>1</sup>, Gladys Casa Cardoso<sup>1</sup>, Ricardo Grau &Aacute;balo<sup>1</sup>, Leticia Arco Garc&iacute;a<sup>1</sup>, Mar&iacute;a Matilde Garc&iacute;a Lorenzo<sup>1</sup>, and Miguel &Aacute;ngel Fernandez Marin<sup>2</sup></b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><sup>1</sup> <i>Universidad Central "Marta Abreu"de Las Villas, Santa Clara, Cuba</i>. <a href="mailto:rmillo@uclv.cu">rmillo@uclv.cu</a></font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><sup>2</sup> <i>Universidad de las Ciencias Inform&aacute;ticas, La Habana, Cuba</i>.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Resumen</b></font></p>  	    <p align="justify"><font face="verdana" size="2">En el presente trabajo se propone un algoritmo para la detecci&oacute;n de ort&oacute;logos que utiliza la agregaci&oacute;n de medidas de similitud para caracterizar la relaci&oacute;n entre los pares de genes de dos genomas. Las medidas se basan en la puntuaci&oacute;n del alineamiento, la longitud de las secuencias, la pertenencia a regiones conservadas y el perfil f&iacute;sico&#45;qu&iacute;mico de las prote&iacute;nas. La fase de agrupamiento sobre el grafo bipartido de similitudes se realiza con el algoritmo de agrupamiento de Markov (MCL). Se define una pol&iacute;tica de asignaci&oacute;n de ort&oacute;logos a partir de los grupos de homolog&iacute;a obtenidos del agrupamiento. La clasificaci&oacute;n se valida con los genomas de <i>Saccharomyces Cerevisiae</i> y de <i>Schizosaccharomyces Pombe</i> usando la lista de ort&oacute;logos del algoritmo INPARANOID 7.0, con la medida de validaci&oacute;n externa ARI. Tambi&eacute;n se aplican medidas de validaci&oacute;n empleando la teor&iacute;a de conjuntos aproximados para medir la calidad con manejo del desbalance de las clases.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Palabras clave:</b> Medidas de similitud, genes ort&oacute;logos, agrupamiento mcl, asignaci&oacute;n de ort&oacute;logos, teor&iacute;a de conjuntos aproximados, desbalance de las clases.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>  	    <p align="justify"><font face="verdana" size="2">This paper presents a novel algorithm for ortholog detection that involves the aggregation of similarity measures characterizing the relationship between gene pairs of two genomes. The measures are based on the alignment score, the length of the sequences, the membership in the conserved regions as well as on the protein physicochemical profile. The clustering step over the similarity bipartite graph is performed by using the Markov clustering algorithm (MCL). A new ortholog assignment policy is applied over the homology groups obtained in the graph clustering. The classification results are validated with the <i>Saccharomyces Cerevisiae</i> and the <i>Schizosaccharomyces Pombe</i> genomes with the ortholog list of the INPARANOID 7.0 algorithm with the Adjusted Rand Index (ARI) external measure. Other validation measures based on the rough set theory are applied to calculate the quality of the classification dealing with class imbalance.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Keywords:</b> Similarity measures, ortholog genes, mcl clustering, ortholog assignment, rough set theory, class imbalance.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    ]]></body>
<body><![CDATA[<p align="left"><font face="verdana" size="2"><a href="/pdf/cys/v18n1/v18n1a3.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Referencias</b></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>1. Achelis, S. B. (1995).</b> <i>Technical Analysis from A to Z.</i> McGraw&#45;Hill.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064895&pid=S1405-5546201400010000300001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>2. Altschul, S. F., Gish, W., Miller, W., Myers, W., &amp; Lipman, D. J. (1990).</b> Basic local alignment search tool. <i>Journal Molecular Biology,</i> 215, 403&#151;410.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064897&pid=S1405-5546201400010000300002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>3. Arco, L. (2008).</b> <i>Agrupamiento basado en la intermediaci&oacute;n diferencial y su valorizaci&oacute;n utilizando la teor&iacute;a de los conjuntos aproximados.</i> Tesis de doctorado, Universidad Central "Marta Abreu"de Las Villas, Santa Clara.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064899&pid=S1405-5546201400010000300003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>4. Ben&#45;Hur, A., Elisseeff, A., &amp; Guyon, I. (2002).</b> A stability based method for discovering structure in clustered data. In <i>Pacific Symposium on Biocomputing.</i> 6&#45;17.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064901&pid=S1405-5546201400010000300004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>5. Bondy, J. A. &amp; Murty, U. S. R. (1976).</b> <i>Graph Theory with Applications.</i> North&#45;Holland.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064903&pid=S1405-5546201400010000300005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>6. Brun, M., Sima, C., Hua, J., Lowey, J., Carroll, B., Suh, E., &amp; Dougherty, E. R. (2007).</b> Model&#45;based evaluation of clustering validation measures. <i>Pattern Recognition,</i> 40, 807&#45;824.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064905&pid=S1405-5546201400010000300006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>7. Carpio&#45;Munoz, C. A. D. &amp; Carbajal, J. C. (2002).</b> Folding pattern recognition in proteins using spectral analysis methods. <i>Genome Informatics,</i> 13, 163&#45;172.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064907&pid=S1405-5546201400010000300007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>8. Chen, X., Zheng, J., Fu, Z., Nan, P., Zhong, Y., Lonardi, S., &amp; Jiang, T. (2005).</b> Assignment of orthologous genes via genome rearrangement. <i>IEEE&#45;ACM transactions on computational biology and bioinformatics,</i> 2(4), 302&#45;315.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064909&pid=S1405-5546201400010000300008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>9. Darling, A. C., Mau, B., &amp; Blattner, F. R. (2004).</b> Mauve: Multiple alignment of conserved genomic sequence with rearrangements. <i>Genome Research,</i> 14(7), 1394&#45;1403.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064911&pid=S1405-5546201400010000300009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>10. Darling, A. E., Mau, B., &amp; Perna, N. T. (2010).</b> progressivemauve: Multiple genome alignment with gene gain, loss and rearrangement. <i>PLOS One,</i> 5(6).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064913&pid=S1405-5546201400010000300010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>11. Deza, E. &amp; Deza, M. (2006).</b> <i>Dictionary of Distances.</i> Elsevier.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064915&pid=S1405-5546201400010000300011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>12. Diestel, R. (2000).</b> <i>Graph Teory.</i> Springer.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064917&pid=S1405-5546201400010000300012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>13. Dongen, S. M. v. (2000).</b> <i>Graph Clustering by Flow Simulation.</i> Phd thesis, Faculty Letteren, University Utrecht, Amsterdam.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064919&pid=S1405-5546201400010000300013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>14. Duch, W. (2000).</b> Similarity&#45;based methods: a general framework for classification, approximation and association. <i>Control and Cybernetics,</i> 29(4), 1&#45;30.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064921&pid=S1405-5546201400010000300014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>15. Fred, A. L. &amp; Jain, A. K. (2003).</b> Robust data clustering. In <i>IEEE Computer Society Conference on Computer Vision and Pattern Recognition,</i> volume 3. 128&#45;136.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064923&pid=S1405-5546201400010000300015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>16. Fu, Z., Chen, X., Vacic, V., Nan, P., Zhong, Y., &amp; Jiang, T. (2007).</b> Msoar: A high&#45;throughput ortholog assignment system based on genome rearrangement. <i>Journal of Computational Biology,</i> 14, 16.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064925&pid=S1405-5546201400010000300016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>17. Galpert, D. (2012).</b> A local&#45;global gene comparison for ortholog detection in two closely related eukaryotes species. <i>Investigacion de Operaciones,</i> 33(2), 130&#45;140.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064927&pid=S1405-5546201400010000300017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>18. Goodstadt, L. &amp; Ponting, C. P. (2006).</b> Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human. <i>PLOS Computational Biology,</i> 2(9).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064929&pid=S1405-5546201400010000300018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>19. Hagelsieb, G. M. &amp; Latimer, K. (2008).</b> Blast options for better detection of orthologs as reciprocal best hits. <i>Bioinformatics,</i> 24, 319&#45;324.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064931&pid=S1405-5546201400010000300019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>20. Hubert, L. &amp; Arabie, P. (1985).</b> Comparing partitions. <i>Journal of Classification,</i> 193&#45;218.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064933&pid=S1405-5546201400010000300020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>21. Kamvysselis, M. (2003).</b> <i>Computational comparative genomics genes, regulation, evolution.</i> Phd thesis, Massachusetts Institute of Technology.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064935&pid=S1405-5546201400010000300021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>22. Komorowski, J., Pawlak, Z., &amp; Polkowski, L. (1999).</b> Rough sets: a tutorial, in rough&#45;fuzzy hybridization: A new trend in decision making. Springer&#45;Verlang, Singapore.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064937&pid=S1405-5546201400010000300022&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>23. Kubat, M. &amp; Matwin, S. (1997).</b> Addressing the curse of imbalanced data sets: One&#45;sided sampling. In <i>14th International Conference on Machine Learning.</i> 179&#45;186.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064939&pid=S1405-5546201400010000300023&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>24. Lee, Y., Sultana, R., Pertea, G., &amp; Cho, J. (2002).</b> Cross&#45;referencing eukaryotic genomes: Tigr orthologous gene alignments (toga). <i>Genome</i> <i>Research,</i> 12(3), 493&#45;502.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064941&pid=S1405-5546201400010000300024&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>25. Li, L., Stoeckert, C. J., &amp; Roos, D. S. (2003).</b> Orthomcl: Identiication of ortholog groups for eukaryotic genomes. <i>Genome Research,</i> 13, 2178&#45;2189.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064943&pid=S1405-5546201400010000300025&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>26. Liu, Y. &amp; Shriberg, E. (2007).</b> Comparing evaluation metrics for sentence boundary detection. In <i>IEEE International Conference on Acoustics, Speech and Signal Processing.</i> 185&#45;188.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064945&pid=S1405-5546201400010000300026&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>27. Metz, C. (1978).</b> Basic principles of roc analysis. <i>Seminars in Nuclear Medicine,</i> 8(4), 283&#45;298.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064947&pid=S1405-5546201400010000300027&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>28. Miyazawa, S. &amp; Jernigan, R. L. (1985).</b> Estimation of effedtive inter&#45;residue contact energies from protein crystal structures quasi&#45;chemical approximation. <i>Macromolecules,</i> 18, 534&#45;552.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064949&pid=S1405-5546201400010000300028&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>29. Mount, D. W. (2004).</b> <i>Bioinformatics Sequence and Genome Analysis.</i> Cold Spring Harbor Laboratory Press.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064951&pid=S1405-5546201400010000300029&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>30. Needleman, S. B. &amp; Wunsch, C. D. (1970).</b> A general method applicable to the search for similarities in the amino acid sequence of two proteins. <i>Journal MolecularBiology,</i> 48(3).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064953&pid=S1405-5546201400010000300030&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>31. O'Brien, K. P., Remm, M., &amp; Sonnhammer., E. L. (2005).</b> Inparanoid: a comprehensive database of eukaryotic orthologs. <i>Nucleic Acids Research,</i> 33, D476&#45;D480.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064955&pid=S1405-5546201400010000300031&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>32. Ostlund, G., Schmitt, T., Forslund, K., &amp; Kostler, T. (2010).</b> Inparanoid 7: new algorithm and tools for eukaryotic orthology analysis. <i>Nucleic Acids Research,</i> 38(Database issue), D196&#45;D203.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064957&pid=S1405-5546201400010000300032&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>33. Overbeek, R., Fonstein, M., D'Souza, M., Pusch, G. D., &amp; Maltsev, N. (1999).</b> The use of gene clusters to infer functional coupling. In <i>Proceedings of the National Academy of Sciences of the United States of America,</i> volume 96. 2896&#45;2901.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064959&pid=S1405-5546201400010000300033&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>34. Pal, A. D., Dovier, A., &amp; Fogolari, F. (2003).</b> Protein folding in clp(fd) with empirical contact energies. In <i>Joint Annual Workshop of the ERCIM</i> <i>Working Group on Constraints and the CoLogNET area on Constraints and Logic Programming, In Recent Advances in Constraints.</i> Springer Verlang, Budapest, Hungary, 250&#45;265.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064961&pid=S1405-5546201400010000300034&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>35. Pawlak, Z. (1982).</b> Rough sets. <i>International Journal of Computer and Information Sciences,</i> 11(5), 341&#45;356.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064963&pid=S1405-5546201400010000300035&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>36. Pawlak, Z. (1991).</b> Rough sets: Theoretical aspects of reasoning about data.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064965&pid=S1405-5546201400010000300036&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>37. Pawlak, Z. (1995).</b> Vagueness and uncertainty: a rough set perspective. <i>Computational Intelligence: an International Journal,</i> 11, 227&#45;232.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064967&pid=S1405-5546201400010000300037&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>38. Rand, W. (1971</b> ). Objective criteria for the evaluation of clustering methods. <i>American Statistical Association,</i> 66(336), 846&#45;850.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064969&pid=S1405-5546201400010000300038&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>39. Rasmussen, M. &amp; Kellis, M. (2005).</b> Multi&#45;bus: An algorithm for resolving multi&#45;species gene correspondence and gene family relationships. <i>CSAIL Research.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064971&pid=S1405-5546201400010000300039&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>40. Remm, M., Storm, C. E. V., &amp; Sonnhammer, E. L. L. (2001</b> ). Automatic clustering of orthologs and in&#45;paralogs from pairwise species comparisons. <i>Journal Molecular Biology,</i> 314, 1041&#45;1052.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064973&pid=S1405-5546201400010000300040&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>41. Santos, J. M. &amp; Embrechts, M. (2009).</b> On the use of the adjusted rand index as a metric for evaluating supervised classification. In <i>ICANN'09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part II.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064975&pid=S1405-5546201400010000300041&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>42. Shulcloper, J. R., Guzman&#45;Arenas, A., &amp; Martinez&#45;Trinidad, J. F. (1995).</b> <i>Enfoque l&oacute;gico combinatorio al reconocimiento de patrones: Selecci&oacute;n de variables y clasificaci&oacute;n supervisada.</i> Instituto Polit&eacute;cnico Nacional.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064977&pid=S1405-5546201400010000300042&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>43. Slowinski, R. &amp; Vanderpooten, D. (1997).</b> Similarity relation as a basis for rough approximations. In <b>Wang, P.,</b> editor, <i>Advances in Machine Intelligence &amp; Soft&#45;Computing.</i> 17&#45;33.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064979&pid=S1405-5546201400010000300043&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>44. Smith, T. F. &amp; Waterman, M. S. (1981).</b> Identification of common molecular sequences. <i>Journal Molecular Biology,</i> 147, 195&#45;197.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064981&pid=S1405-5546201400010000300044&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>45. Tatusov, R. L. (2003).</b> The cog database: an updated version includes eukaryotes. <i>BMC Bioinformatics,</i> 4(41).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064983&pid=S1405-5546201400010000300045&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>46. Tatusov, R. L., Koonin, E. V., &amp; Lipman, D. J. (1997).</b> A genomic perspective on protein families. <i>Science,</i> 278(5338).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064985&pid=S1405-5546201400010000300046&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>47. Towfic, F., Greenlee, M. H. W., &amp; Honavar, V. (2009).</b> Detection of gene orthology based on protein&#45;protein interaction networks. In <i>IEEE International Conference on Bioinformatics and Biomedicine.</i> Washington DC, USA, 48&#45;53.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064987&pid=S1405-5546201400010000300047&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>48. van Rijsbergen, C. J. (1979).</b> <i>Information retrieval.</i> Butterworths, 2nd edition edition.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064989&pid=S1405-5546201400010000300048&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>49. Webber, C. A. P. &amp; Chris, P. (2004).</b> Genes and homology. <i>Current Biology,</i> 14(R332).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064991&pid=S1405-5546201400010000300049&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>50. Weiss, G. M. &amp; Provost, F. (2003).</b> Learning when trining data are costly: The effect of class distribution on tree induction. <i>Journal Artificial Intelligence Research,</i> 19, 315&#45;354.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064993&pid=S1405-5546201400010000300050&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>51. Yoon, K. &amp; Kwek, S. (2005).</b> An unsupervised learning approach to resolving the data imbalanced issue in suppervised learning problems in functional genomics. In <i>Proceedings of the Fifth International Conference on Hybrid Intelligent Systems.</i> 303&#45;308.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2064995&pid=S1405-5546201400010000300051&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Achelis]]></surname>
<given-names><![CDATA[S. B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Technical Analysis from A to Z]]></source>
<year>1995</year>
<publisher-name><![CDATA[McGraw-Hill]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Altschul]]></surname>
<given-names><![CDATA[S. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Gish]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Miller]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Myers]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Lipman]]></surname>
<given-names><![CDATA[D. J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Basic local alignment search tool]]></article-title>
<source><![CDATA[Journal Molecular Biology]]></source>
<year>1990</year>
<volume>215</volume>
<page-range>403-410</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Arco]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Agrupamiento basado en la intermediación diferencial y su valorización utilizando la teoría de los conjuntos aproximados]]></source>
<year>2008</year>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ben-Hur]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Elisseeff]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Guyon]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A stability based method for discovering structure in clustered data]]></article-title>
<source><![CDATA[Pacific Symposium on Biocomputing]]></source>
<year>2002</year>
<page-range>6-17</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bondy]]></surname>
<given-names><![CDATA[J. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Murty]]></surname>
<given-names><![CDATA[U. S. R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Graph Theory with Applications]]></source>
<year>1976</year>
<publisher-loc><![CDATA[North-Holland ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brun]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Sima]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Hua]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Lowey]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Carroll]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Suh]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Dougherty]]></surname>
<given-names><![CDATA[E. R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Model-based evaluation of clustering validation measures]]></article-title>
<source><![CDATA[Pattern Recognition]]></source>
<year>2007</year>
<volume>40</volume>
<page-range>807-824</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carpio-Munoz]]></surname>
<given-names><![CDATA[C. A. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Carbajal]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Folding pattern recognition in proteins using spectral analysis methods]]></article-title>
<source><![CDATA[Genome Informatics]]></source>
<year>2002</year>
<volume>13</volume>
<page-range>163-172</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Zheng]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Fu]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Nan]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhong]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Lonardi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Jiang]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Assignment of orthologous genes via genome rearrangement]]></article-title>
<source><![CDATA[IEEE-ACM transactions on computational biology and bioinformatics]]></source>
<year>2005</year>
<volume>2</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>302-315</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Darling]]></surname>
<given-names><![CDATA[A. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Mau]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Blattner]]></surname>
<given-names><![CDATA[F. R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Mauve: Multiple alignment of conserved genomic sequence with rearrangements]]></article-title>
<source><![CDATA[Genome Research]]></source>
<year>2004</year>
<volume>14</volume>
<numero>7</numero>
<issue>7</issue>
<page-range>1394-1403</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Darling]]></surname>
<given-names><![CDATA[A. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Mau]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Perna]]></surname>
<given-names><![CDATA[N. T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[progressivemauve: Multiple genome alignment with gene gain, loss and rearrangement]]></article-title>
<source><![CDATA[PLOS One]]></source>
<year>2010</year>
<volume>5</volume>
<numero>6</numero>
<issue>6</issue>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Deza]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Deza]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Dictionary of Distances]]></source>
<year>2006</year>
<publisher-name><![CDATA[Elsevier]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Diestel]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Graph Teory]]></source>
<year>2000</year>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dongen]]></surname>
<given-names><![CDATA[S. M. v.]]></given-names>
</name>
</person-group>
<source><![CDATA[Graph Clustering by Flow Simulation]]></source>
<year>2000</year>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Duch]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Similarity-based methods: a general framework for classification, approximation and association]]></article-title>
<source><![CDATA[Control and Cybernetics]]></source>
<year>2000</year>
<volume>29</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>1-30</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fred]]></surname>
<given-names><![CDATA[A. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Jain]]></surname>
<given-names><![CDATA[A. K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Robust data clustering]]></article-title>
<source><![CDATA[IEEE Computer Society Conference on Computer Vision and Pattern Recognition]]></source>
<year>2003</year>
<volume>3</volume>
<page-range>128-136</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fu]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Vacic]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Nan]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhong]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Jiang]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Msoar: A high-throughput ortholog assignment system based on genome rearrangement]]></article-title>
<source><![CDATA[Journal of Computational Biology]]></source>
<year>2007</year>
<volume>14</volume>
<page-range>16</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Galpert]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A local-global gene comparison for ortholog detection in two closely related eukaryotes species]]></article-title>
<source><![CDATA[Investigacion de Operaciones]]></source>
<year>2012</year>
<volume>33</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>130-140</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Goodstadt]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Ponting]]></surname>
<given-names><![CDATA[C. P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human]]></article-title>
<source><![CDATA[PLOS Computational Biology]]></source>
<year>2006</year>
<volume>2</volume>
<numero>9</numero>
<issue>9</issue>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hagelsieb]]></surname>
<given-names><![CDATA[G. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Latimer]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Blast options for better detection of orthologs as reciprocal best hits]]></article-title>
<source><![CDATA[Bioinformatics]]></source>
<year>2008</year>
<volume>24</volume>
<page-range>319-324</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hubert]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Arabie]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Comparing partitions]]></article-title>
<source><![CDATA[Journal of Classification]]></source>
<year>1985</year>
<page-range>193-218</page-range></nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kamvysselis]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computational comparative genomics genes, regulation, evolution]]></source>
<year>2003</year>
</nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Komorowski]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Pawlak]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Polkowski]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Rough sets: a tutorial, in rough-fuzzy hybridization: A new trend in decision making]]></source>
<year>1999</year>
<publisher-name><![CDATA[SpringerVerlang]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kubat]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Matwin]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Addressing the curse of imbalanced data sets: One-sided sampling]]></article-title>
<source><![CDATA[14th International Conference on Machine Learning]]></source>
<year>1997</year>
<page-range>179-186</page-range></nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Sultana]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Pertea]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Cho]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Cross-referencing eukaryotic genomes: Tigr orthologous gene alignments (toga)]]></article-title>
<source><![CDATA[Genome Research]]></source>
<year>2002</year>
<volume>12</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>493-502</page-range></nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Stoeckert]]></surname>
<given-names><![CDATA[C. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Roos]]></surname>
<given-names><![CDATA[D. S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Orthomcl: Identiication of ortholog groups for eukaryotic genomes]]></article-title>
<source><![CDATA[Genome Research]]></source>
<year>2003</year>
<volume>13</volume>
<page-range>2178-2189</page-range></nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Shriberg]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Comparing evaluation metrics for sentence boundary detection]]></article-title>
<source><![CDATA[IEEE International Conference on Acoustics, Speech and Signal Processing]]></source>
<year>2007</year>
<page-range>185-188</page-range></nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Metz]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Basic principles of roc analysis]]></article-title>
<source><![CDATA[Seminars in Nuclear Medicine]]></source>
<year>1978</year>
<volume>8</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>283-298</page-range></nlm-citation>
</ref>
<ref id="B28">
<label>28</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Miyazawa]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Jernigan]]></surname>
<given-names><![CDATA[R. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Estimation of effedtive inter-residue contact energies from protein crystal structures quasi-chemical approximation]]></article-title>
<source><![CDATA[Macromolecules]]></source>
<year>1985</year>
<volume>18</volume>
<page-range>534-552</page-range></nlm-citation>
</ref>
<ref id="B29">
<label>29</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mount]]></surname>
<given-names><![CDATA[D. W.]]></given-names>
</name>
</person-group>
<source><![CDATA[Bioinformatics Sequence and Genome Analysis]]></source>
<year>2004</year>
<publisher-name><![CDATA[Cold Spring Harbor Laboratory Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B30">
<label>30</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Needleman]]></surname>
<given-names><![CDATA[S. B.]]></given-names>
</name>
<name>
<surname><![CDATA[Wunsch]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A general method applicable to the search for similarities in the amino acid sequence of two proteins]]></article-title>
<source><![CDATA[Journal MolecularBiology]]></source>
<year>1970</year>
<volume>48</volume>
<numero>3</numero>
<issue>3</issue>
</nlm-citation>
</ref>
<ref id="B31">
<label>31</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[O'Brien]]></surname>
<given-names><![CDATA[K. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Remm]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Sonnhammer]]></surname>
<given-names><![CDATA[E. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Inparanoid: a comprehensive database of eukaryotic orthologs]]></article-title>
<source><![CDATA[Nucleic Acids Research]]></source>
<year>2005</year>
<volume>33</volume>
<page-range>D476-D480</page-range></nlm-citation>
</ref>
<ref id="B32">
<label>32</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ostlund]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Schmitt]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Forslund]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Kostler]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Inparanoid 7: new algorithm and tools for eukaryotic orthology analysis]]></article-title>
<source><![CDATA[Nucleic Acids Research]]></source>
<year>2010</year>
<volume>38</volume>
<page-range>D196-D203</page-range></nlm-citation>
</ref>
<ref id="B33">
<label>33</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Overbeek]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Fonstein]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[D'Souza]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Pusch]]></surname>
<given-names><![CDATA[G. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Maltsev]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The use of gene clusters to infer functional coupling]]></article-title>
<source><![CDATA[Proceedings of the National]]></source>
<year>1999</year>
<volume>96</volume>
<page-range>2896-2901</page-range><publisher-name><![CDATA[Academy of Sciences of the United States of America]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B34">
<label>34</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pal]]></surname>
<given-names><![CDATA[A. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Dovier]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Fogolari]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Protein folding in clp(fd) with empirical contact energies]]></article-title>
<source><![CDATA[Joint Annual Workshop of the ERCIM Working Group on Constraints and the CoLogNET area on Constraints and Logic Programming, In Recent Advances in Constraints]]></source>
<year>2003</year>
<page-range>250-265</page-range><publisher-loc><![CDATA[Budapest ]]></publisher-loc>
<publisher-name><![CDATA[SpringerVerlang]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B35">
<label>35</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pawlak]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Rough sets]]></article-title>
<source><![CDATA[International Journal of Computer and Information Sciences]]></source>
<year>1982</year>
<volume>11</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>341-356</page-range></nlm-citation>
</ref>
<ref id="B36">
<label>36</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pawlak]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
</person-group>
<source><![CDATA[Rough sets: Theoretical aspects of reasoning about data]]></source>
<year>1991</year>
</nlm-citation>
</ref>
<ref id="B37">
<label>37</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pawlak]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Vagueness and uncertainty: a rough set perspective]]></article-title>
<source><![CDATA[Computational Intelligence: an International Journal]]></source>
<year>1995</year>
<volume>11</volume>
<page-range>227-232</page-range></nlm-citation>
</ref>
<ref id="B38">
<label>38</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rand]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Objective criteria for the evaluation of clustering methods]]></article-title>
<source><![CDATA[American Statistical Association]]></source>
<year>1971</year>
<volume>66</volume>
<numero>336</numero>
<issue>336</issue>
<page-range>846-850</page-range></nlm-citation>
</ref>
<ref id="B39">
<label>39</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rasmussen]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Kellis]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Multi-bus: An algorithm for resolving multi-species gene correspondence and gene family relationships]]></source>
<year>2005</year>
<publisher-name><![CDATA[CSAIL Research]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B40">
<label>40</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Remm]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Storm]]></surname>
<given-names><![CDATA[C. E. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Sonnhammer]]></surname>
<given-names><![CDATA[E. L. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic clustering of orthologs and in-paralogs from pairwise species comparisons]]></article-title>
<source><![CDATA[Journal Molecular Biology]]></source>
<year>2001</year>
<volume>314</volume>
<page-range>1041-1052</page-range></nlm-citation>
</ref>
<ref id="B41">
<label>41</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Santos]]></surname>
<given-names><![CDATA[J. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Embrechts]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[On the use of the adjusted rand index as a metric for evaluating supervised classification]]></article-title>
<source><![CDATA[ICANN'09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part II]]></source>
<year>2009</year>
</nlm-citation>
</ref>
<ref id="B42">
<label>42</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shulcloper]]></surname>
<given-names><![CDATA[J. R.]]></given-names>
</name>
<name>
<surname><![CDATA[Guzman-Arenas]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Martinez-Trinidad]]></surname>
<given-names><![CDATA[J. F.]]></given-names>
</name>
</person-group>
<source><![CDATA[Enfoque lógico combinatorio al reconocimiento de patrones: Selección de variables y clasificación supervisada]]></source>
<year>1995</year>
<publisher-name><![CDATA[Instituto Politécnico Nacional]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B43">
<label>43</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Slowinski]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Vanderpooten]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Similarity relation as a basis for rough approximations]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Advances in Machine Intelligence & Soft-Computing]]></source>
<year>1997</year>
<page-range>17-33</page-range></nlm-citation>
</ref>
<ref id="B44">
<label>44</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Smith]]></surname>
<given-names><![CDATA[T. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Waterman]]></surname>
<given-names><![CDATA[M. S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Identification of common molecular sequences]]></article-title>
<source><![CDATA[Journal Molecular Biology]]></source>
<year>1981</year>
<volume>147</volume>
<page-range>195-197</page-range></nlm-citation>
</ref>
<ref id="B45">
<label>45</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tatusov]]></surname>
<given-names><![CDATA[R. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The cog database: an updated version includes eukaryotes]]></article-title>
<source><![CDATA[BMC Bioinformatics]]></source>
<year>2003</year>
<volume>4</volume>
<numero>41</numero>
<issue>41</issue>
</nlm-citation>
</ref>
<ref id="B46">
<label>46</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tatusov]]></surname>
<given-names><![CDATA[R. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Koonin]]></surname>
<given-names><![CDATA[E. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Lipman]]></surname>
<given-names><![CDATA[D. J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A genomic perspective on protein families]]></article-title>
<source><![CDATA[Science]]></source>
<year>1997</year>
<volume>278</volume>
<numero>5338</numero>
<issue>5338</issue>
</nlm-citation>
</ref>
<ref id="B47">
<label>47</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Towfic]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Greenlee]]></surname>
<given-names><![CDATA[M. H. W.]]></given-names>
</name>
<name>
<surname><![CDATA[Honavar]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Detection of gene orthology based on protein-protein interaction networks]]></article-title>
<source><![CDATA[IEEE International Conference on Bioinformatics and Biomedicine]]></source>
<year>2009</year>
<page-range>48-53</page-range><publisher-loc><![CDATA[Washington^eDC DC]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B48">
<label>48</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[van Rijsbergen]]></surname>
<given-names><![CDATA[C. J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Information retrieval]]></source>
<year>1979</year>
<edition>2</edition>
<publisher-name><![CDATA[Butterworths]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B49">
<label>49</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Webber]]></surname>
<given-names><![CDATA[C. A. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Chris]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Genes and homology]]></article-title>
<source><![CDATA[Current Biology]]></source>
<year>2004</year>
<volume>14</volume>
<numero>R332</numero>
<issue>R332</issue>
</nlm-citation>
</ref>
<ref id="B50">
<label>50</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Weiss]]></surname>
<given-names><![CDATA[G. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Provost]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Learning when trining data are costly: The effect of class distribution on tree induction]]></article-title>
<source><![CDATA[Journal Artificial Intelligence Research]]></source>
<year>2003</year>
<volume>19</volume>
<page-range>315-354</page-range></nlm-citation>
</ref>
<ref id="B51">
<label>51</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yoon]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Kwek]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An unsupervised learning approach to resolving the data imbalanced issue in suppervised learning problems in functional genomics]]></article-title>
<source><![CDATA[Proceedings of the Fifth International Conference on Hybrid Intelligent Systems]]></source>
<year>2005</year>
<page-range>303-308</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
