<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462012000400002</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[Detección de plagio translingüe utilizando el diccionario estadístico de BabelNet]]></article-title>
<article-title xml:lang="en"><![CDATA[Cross-language Plagiarism Detection Using BabelNet's Statistical Dictionary]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Franco-Salvador]]></surname>
<given-names><![CDATA[Marc]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Gupta]]></surname>
<given-names><![CDATA[Parth]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[Paolo]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Universitat Politècnica de València Departamento de Sistemas Informáticos y Computación Natural Language Engineering Lab - ELiRF]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>España</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2012</year>
</pub-date>
<volume>16</volume>
<numero>4</numero>
<fpage>383</fpage>
<lpage>390</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462012000400002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462012000400002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462012000400002&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="es"><p><![CDATA[En los últimos años ha habido importantes avances en el campo de la detección de plagio automática. Uno de ellos es la detección de plagio translingüe, la cual trata de detectar el plagio entre documentos en diferentes idiomas. La mayoría de aproximaciones que existen para esta tarea hacen uso de diccionarios estadísticos para lidiar con las traducciones de las palabras de los documentos. Un diccionario estadístico nos proporciona, para una palabra dada, la lista de traducciones posibles con sus respectivas probabilidades. El objetivo de este trabajo es analizar el rendimiento del diccionario estadístico de la red semántica multilingüe BabelNet para la tarea de detección de plagio translingüe. En la evaluación comparamos sus resultados con los ofrecidos por un diccionario estadístico entrenado con el conocido modelo de alineamiento IBM M1, ambos utilizando el modelo estado del arte CL-ASA como base. Los resultados de los experimentos indican que BabelNet es una buena alternativa como diccionario estadístico.]]></p></abstract>
<abstract abstract-type="short" xml:lang="en"><p><![CDATA[In recent years there have been important advances in the field of automatic plagiarism detection. One variant is cross-language plagiarism detection, which tries to detect plagiarism between documents in different languages. Most of the existing approaches to this task make use of statistical dictionaries to deal with the translations of words in the documents. A statistical dictionary provides, for a given word, the list of possible translations with their respective probabilities. The objective of this paper is to analyze the performance of the statistical dictionary of multilingual semantic network - Babelnet for cross-language plagiarism detection. In the evaluation we compare its results with those offered by a statistical dictionary trained by the well-known IBM M1 aligment model, both using state-of-the-art model CL-ASA as a base. The results of the experiments indicate that Babelnet is a good alternative as statistical dictionary.]]></p></abstract>
<kwd-group>
<kwd lng="es"><![CDATA[Detección de plagio translingüe]]></kwd>
<kwd lng="es"><![CDATA[similitud textual]]></kwd>
<kwd lng="es"><![CDATA[diccionario estadístico]]></kwd>
<kwd lng="es"><![CDATA[BabelNet]]></kwd>
<kwd lng="en"><![CDATA[Cross-language plagiarism detection]]></kwd>
<kwd lng="en"><![CDATA[textual similarity]]></kwd>
<kwd lng="en"><![CDATA[statistical dictionary]]></kwd>
<kwd lng="en"><![CDATA[BabelNet]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="justify"><font face="verdana" size="4">Art&iacute;culos</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="4"><b>Detecci&oacute;n de plagio transling&uuml;e utilizando el diccionario estad&iacute;stico</b> <b>de BabelNet</b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="3"><b>Cross&#45;language Plagiarism Detection Using BabelNet's Statistical Dictionary</b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="2"><b>Marc Franco&#45;Salvador, Parth Gupta y Paolo Rosso</b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><i>Natural Language Engineering Lab &#45; ELiRF, Departamento de Sistemas Inform&aacute;ticos y Computaci&oacute;n, Universitat Polit&egrave;cnica de Val&egrave;ncia, Espa&ntilde;a. Correo:</i> <a href="mailto:mfranco@dsic.upv.es">mfranco@dsic.upv.es</a>, <a href="mailto:pgupta@dsic.upv.es">pgupta@dsic.upv.es</a>, <a href="mailto:prosso@dsic.upv.es">prosso@dsic.upv.es</a></font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2">Article received on 25/10/2012.    <br> 	Accepted on 26/11/2012.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Resumen</b></font></p>  	    <p align="justify"><font face="verdana" size="2">En los &uacute;ltimos a&ntilde;os ha habido importantes avances en el campo de la detecci&oacute;n de plagio autom&aacute;tica. Uno de ellos es la detecci&oacute;n de plagio transling&uuml;e, la cual trata de detectar el plagio entre documentos en diferentes idiomas. La mayor&iacute;a de aproximaciones que existen para esta tarea hacen uso de diccionarios estad&iacute;sticos para lidiar con las traducciones de las palabras de los documentos. Un diccionario estad&iacute;stico nos proporciona, para una palabra dada, la lista de traducciones posibles con sus respectivas probabilidades. El objetivo de este trabajo es analizar el rendimiento del diccionario estad&iacute;stico de la red sem&aacute;ntica multiling&uuml;e BabelNet para la tarea de detecci&oacute;n de plagio transling&uuml;e. En la evaluaci&oacute;n comparamos sus resultados con los ofrecidos por un diccionario estad&iacute;stico entrenado con el conocido modelo de alineamiento IBM M1, ambos utilizando el modelo estado del arte CL&#45;ASA como base. Los resultados de los experimentos indican que BabelNet es una buena alternativa como diccionario estad&iacute;stico.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Palabras clave:</b> Detecci&oacute;n de plagio transling&uuml;e, similitud textual, diccionario estad&iacute;stico, BabelNet.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>  	    <p align="justify"><font face="verdana" size="2">In recent years there have been important advances in the field of automatic plagiarism detection. One variant is cross&#45;language plagiarism detection, which tries to detect plagiarism between documents in different languages. Most of the existing approaches to this task make use of statistical dictionaries to deal with the translations of words in the documents. A statistical dictionary provides, for a given word, the list of possible translations with their respective probabilities. The objective of this paper is to analyze the performance of the statistical dictionary of multilingual semantic network &#45; Babelnet for cross&#45;language plagiarism detection. In the evaluation we compare its results with those offered by a statistical dictionary trained by the well&#45;known IBM M1 aligment model, both using state&#45;of&#45;the&#45;art model CL&#45;ASA as a base. The results of the experiments indicate that Babelnet is a good alternative as statistical dictionary.</font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Keywords:</b> Cross&#45;language plagiarism detection, textual similarity, statistical dictionary, BabelNet.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><a href="/pdf/cys/v16n4/v16n4a2.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Agradecimientos</b></font></p>  	    <p align="justify"><font face="verdana" size="2">En primer lugar agradecer a la Conseller&iacute;a D'educaci&oacute;, Formaci&oacute; i Ocupaci&oacute; de la Generalitat Valenciana por la financiaci&oacute;n por parte del programa Ger&oacute;nimo Forteza, sin el cual no hubiera sido posible llevar a cabo la investigaci&oacute;n del primer autor que ha llevado a esta publicaci&oacute;n. Este trabajo se ha hecho dentro del &aacute;mbito del <i>VLC/CAMPUS Microcluster on Multimodal Interaction in Intelligent Systems</i> y como parte del proyecto de la Comisi&oacute;n Europea WIQ&#45;EI IRSES (no. 269180). Por otro lado agradecer a Roberto Navigilipor haber desarrollado BabelNet y ofrecer su ayuda para familiarizarnos con el API sistema. Finalmente a Alberto Barr&oacute;n Cede&ntilde;o por desarrollar la versi&oacute;n inicial del modelo CL&#45;ASA.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Referencias</b></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>1 . Barr&oacute;n&#45;Cede&ntilde;o, A. (2012).</b> <i>On the mono&#45;and cross&#45;language detection of text re&#45;use and plagiarism.</i> Ph.D. thesis, Universitat Politc&egrave;nica de Val&egrave;ncia.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058042&pid=S1405-5546201200040000200001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>2 . Barr&oacute;n&#45;Cede&ntilde;o, A., Rosso, P., Pinto, D., &amp; Juan, A. (2008).</b> On cross&#45;lingual plagiarism analysis using a statistical model. In <i>proceedings of the ECAI'08 workshop on Uncovering Plagiarism, Authorship and Social Software Misuse,</i> PAN'08.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058044&pid=S1405-5546201200040000200002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>3 . Brown, P., Della Pietra, S., Della Pietra, V., &amp; Mercer, R. (1993).</b> The mathematics of statistical machine translation: Parameter estimation. In <i>Computational Linguistics,</i> volume 19(2). 263&#45;311.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058046&pid=S1405-5546201200040000200003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>4 . Comas, R. &amp; Sureda, J. (2008).</b> Academic cyberplagiarism: tracing the causes to reach solutions. <i>Digithum,</i> 10, 1&#45;6.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058048&pid=S1405-5546201200040000200004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>5 . Dumais, S. T., Letsche, T. A., Littman, M. L., &amp; Landauer, T. K. (1997).</b> Automatic cross&#45;language retrieval using latent semantic indexing. In <i>AAAI&#45;97 Spring Symposium Series: Cross&#45;Language Text and Speech Retrieval.</i> Hull &amp; D. Oard (eds.), 18&#45;24.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058050&pid=S1405-5546201200040000200005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>6 . Fellbaum, C. (1998).</b> Wordnet: An electronic lexical database. MIT Press.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058052&pid=S1405-5546201200040000200006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>7 . Mcnamee, P. &amp; Mayfield, J. (2004).</b> Character n&#45;gram tokenization for european language text retrieval. <i>Inf. Retr.,</i> 7(1&#45;2), 73&#45;97. ISSN 1386&#45;4564.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058054&pid=S1405-5546201200040000200007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>8 . Navigli, R. &amp; Ponzetto, S. P. (2010).</b> Babelnet: building a very large multilingual semantic network. In <i>proceedings of the 48th annual meeting of the Association for Computational Linguistics,</i> ACL '10. Stroudsburg, PA, USA, 216&#45;225.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058056&pid=S1405-5546201200040000200008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>9 . Navigli, R. &amp; Ponzetto, S. P. (2012).</b> Multilingual wsd with just a few lines of code: The babelnet api. In <i>50th annual meeting of the Association for Computational Linguistics.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058058&pid=S1405-5546201200040000200009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>10 . Och, F. J. &amp; Ney, H. (2003).</b> A systematic comparison of various statistical alignment models. In <i>Computational Linguistics,</i> volume 29(1). 19&#45;51.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058060&pid=S1405-5546201200040000200010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>11 . Pinto, D., Civera, J., Barr&oacute;n&#45;Cede&ntilde;o, A., Juan,</b> <b>A., &amp; Rosso, P. (2009).</b> A statistical approach to crosslingual natural language tasks. <i>journal of algorithms,</i> 64(1), 51&#45;60. doi:10.1016/j.jalgor.2009. 02.005.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058062&pid=S1405-5546201200040000200011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>12 . Potthast, M., Barr&oacute;n&#45;Cede&ntilde;o, A., Stein, B., &amp; Rosso, P. (2011</b> ). Cross&#45;language plagiarism detection. <i>Language Resources and Evaluation, Special Issue on Plagiarism and Authorship Analysis,</i> 45(1).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058064&pid=S1405-5546201200040000200012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>13 . Potthast, M., Barr&oacute;n&#45;Cede&ntilde;o, A., Stein, B., &amp; Rosso, P. (2010).</b> An evaluation framework for plagiarism detection. In <i>proc. of the 23rdint. conf. on Computational Linguistics,</i> COLING&#45;2010. Beijing, China, 997&#45;1005.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058066&pid=S1405-5546201200040000200013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>14 . Potthast, M., Eiselt, A., Barr&oacute;n&#45;Cede&ntilde;o, A., Stein,</b> <b>B . , &amp; Rosso, P. (2011</b> ). Overview of the 3rd international competition on plagiarism detection. In <i>CLEF (Notebook Papers/Labs/Workshop).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058068&pid=S1405-5546201200040000200014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>15 . Pouliquen, B., Steinberger, R., &amp; Ignat, C.</b> <b>(2003).</b> Automatic annotation of multilingual text collections with a conceptual thesaurus. In <i>workshop 'Ontologies and Information Extraction' at the Summer School The Semantic Web and Language Technology&#151;its Potential and Practicalities',</i> EUROLAN'2003. 9&#45;28.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058070&pid=S1405-5546201200040000200015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>16 . Stein, B. &amp; Anderka, M. (2009).</b> Collection&#45;relative representations: A unifying view to retrieval models. In <i>20th international conference on Database and Expert Systems Applications,</i> DEXA'09. A.M. Tjoa &amp; R.R. Wagner (eds.), 383&#45;387.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058072&pid=S1405-5546201200040000200016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>17 . Steinberger, R., Pouliquen, B., &amp; Ignat, C. (2004).</b> Exploiting multilingual nomenclatures and language&#45;independent text features as an interlingua for cross&#45;lingual text analysis applications. In <i>4th Slovenian Language Technology Conference,</i> IS'2004. Information Society.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058074&pid=S1405-5546201200040000200017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>18 . Steinberger, R., Pouliquen, B., Widiger, A., Ignat, C., Erjavec, T., Tufis, D., &amp; Varga, D. (2006).</b> The jrc&#45;acquis: A multilingual aligned parallel corpus with +20 languages. In <i>5th international conference on Language Resources and Evaluation.</i> LREC'2006.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058076&pid=S1405-5546201200040000200018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>19 . Torrej&oacute;n, D. &amp; Ramos, J. (2011).</b> Crosslingual coremo system (contextual reference monotony). In <i>CLEF (Notebook Papers/Labs/Workshop).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058078&pid=S1405-5546201200040000200019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>20 . Vinokourov, A., Shawe&#45;Taylor, J., &amp; Cristianini, N. (2003).</b> Inferring a semantic representation of text via cross&#45;language correlation analysis. In <i>NIPS&#45;02: Advances in Neural Information Processing Systems.</i> S. Becker, S. Thrun, &amp; K. Obermayer (eds.), 1473&#45;1480.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058080&pid=S1405-5546201200040000200020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>21 . Vossen, P. (2004).</b> Eurowordnet: A multilingual database of autonomous and language&#45;specific wordnets connected via an inter&#45;lingual index. In <i>international journal of Lexicography,</i> volume 17.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2058082&pid=S1405-5546201200040000200021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[ ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Barrón-Cedeño]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[On the mono-and cross-language detection of text re-use and plagiarism]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Barrón-Cedeño]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Pinto]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Juan]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[On cross-lingual plagiarism analysis using a statistical model]]></source>
<year>2008</year>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brown]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Della Pietra]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Della Pietra]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Mercer]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The mathematics of statistical machine translation: Parameter estimation]]></article-title>
<source><![CDATA[Computational Linguistics]]></source>
<year>1993</year>
<volume>19</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>263-311</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Comas]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Sureda]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Academic cyberplagiarism: tracing the causes to reach solutions]]></article-title>
<source><![CDATA[Digithum]]></source>
<year>2008</year>
<volume>10</volume>
<page-range>1-6</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dumais]]></surname>
<given-names><![CDATA[S. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Letsche]]></surname>
<given-names><![CDATA[T. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Littman]]></surname>
<given-names><![CDATA[M. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Landauer]]></surname>
<given-names><![CDATA[T. K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Automatic cross-language retrieval using latent semantic indexing]]></source>
<year>1997</year>
<page-range>18-24</page-range><publisher-name><![CDATA[Hull & D. Oard]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fellbaum]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Wordnet: An electronic lexical database]]></source>
<year>1998</year>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mcnamee]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Mayfield]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Character n-gram tokenization for european language text retrieval]]></article-title>
<source><![CDATA[Inf. Retr.]]></source>
<year>2004</year>
<volume>7</volume>
<numero>1</numero><numero>2</numero>
<issue>1</issue><issue>2</issue>
<page-range>73-97</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Navigli]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Ponzetto]]></surname>
<given-names><![CDATA[S. P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Babelnet: building a very large multilingual semantic network]]></source>
<year>2010</year>
<page-range>216-225</page-range><publisher-loc><![CDATA[Stroudsburg^ePA PA]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Navigli]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Ponzetto]]></surname>
<given-names><![CDATA[S. P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Multilingual wsd with just a few lines of code: The babelnet api]]></source>
<year>2012</year>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Och]]></surname>
<given-names><![CDATA[F. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Ney]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A systematic comparison of various statistical alignment models]]></article-title>
<source><![CDATA[Computational Linguistics]]></source>
<year>2003</year>
<volume>29</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>19-51</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pinto]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Civera]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Barrón-Cedeño]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Juan]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A statistical approach to crosslingual natural language tasks]]></article-title>
<source><![CDATA[journal of algorithms]]></source>
<year>2009</year>
<volume>64</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>51-60</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Potthast]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Barrón-Cedeño]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Stein]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Cross-language plagiarism detection]]></article-title>
<source><![CDATA[Language Resources and Evaluation, Special Issue on Plagiarism and Authorship Analysis]]></source>
<year>2011</year>
<volume>45</volume>
<numero>1</numero>
<issue>1</issue>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Potthast]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Barrón-Cedeño]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Stein]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[An evaluation framework for plagiarism detection]]></source>
<year>2010</year>
<page-range>997-1005</page-range><publisher-loc><![CDATA[Beijing ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Potthast]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Eiselt]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Barrón-Cedeño]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Stein]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Overview of the 3rd international competition on plagiarism detection]]></source>
<year>2011</year>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pouliquen]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Steinberger]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Ignat]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Automatic annotation of multilingual text collections with a conceptual thesaurus]]></source>
<year>2003</year>
<page-range>9-28</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Stein]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Anderka]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Collection-relative representations: A unifying view to retrieval models]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Tjoa]]></surname>
<given-names><![CDATA[A.M.]]></given-names>
</name>
<name>
<surname><![CDATA[Wagner]]></surname>
<given-names><![CDATA[R.R.]]></given-names>
</name>
</person-group>
<source><![CDATA[20th international conference on Database and Expert Systems Applications, DEXA'09]]></source>
<year>2009</year>
<page-range>383-387</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Steinberger]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Pouliquen]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Ignat]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Exploiting multilingual nomenclatures and language-independent text features as an interlingua for cross-lingual text analysis applications]]></source>
<year>2004</year>
<publisher-name><![CDATA[Information Society]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Steinberger]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Pouliquen]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Widiger]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Ignat]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Erjavec]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Tufis]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Varga]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[The jrc-acquis: A multilingual aligned parallel corpus with +20 languages]]></source>
<year>2006</year>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Torrejón]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Ramos]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Crosslingual coremo system (contextual reference monotony)]]></source>
<year>2011</year>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vinokourov]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Shawe-Taylor]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Cristianini]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Inferring a semantic representation of text via cross-language correlation analysis]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Becker]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Thrun]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Obermayer]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[NIPS-02: Advances in Neural Information Processing Systems]]></source>
<year>2003</year>
<page-range>1473-1480</page-range></nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vossen]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Eurowordnet: A multilingual database of autonomous and language-specific wordnets connected via an inter-lingual index.]]></article-title>
<source><![CDATA[international journal of Lexicography]]></source>
<year>2004</year>
<volume>17</volume>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
