<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462013000200006</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Generation of Bilingual Dictionaries using Structural Properties]]></article-title>
<article-title xml:lang="es"><![CDATA[Generación de diccionarios bilingües usando las propiedades estructurales]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Dubey]]></surname>
<given-names><![CDATA[Ajay]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Varma]]></surname>
<given-names><![CDATA[Vasudeva]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,International Institute of Information Technology Search and Information Extraction Laboratory ]]></institution>
<addr-line><![CDATA[Hyderabad AP]]></addr-line>
<country>India</country>
</aff>
<aff id="A02">
<institution><![CDATA[,International Institute of Information Technology Search and Information Extraction Laboratory ]]></institution>
<addr-line><![CDATA[Hyderabad AP]]></addr-line>
<country>India</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2013</year>
</pub-date>
<volume>17</volume>
<numero>2</numero>
<fpage>161</fpage>
<lpage>168</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462013000200006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462013000200006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462013000200006&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Building bilingual dictionaries from Wikipedia has been extensively studied in the area of computation linguistics. These dictionaries play a crucial role in Natural Language Processing(NLP) applications like Cross-Lingual Information Retrieval, Machine Translation and Named Entity Recognition. To build these dictionaries, most of the existing approaches use information present in Wikipedia titles, info-boxes and categories. Interestingly, not many use the structural properties of a document like sections, subsections, etc. In this work we exploit the structural properties of documents to build a bilingual English-Hindi dictionary. The main intuition behind this approach is that documents in different languages discussing the same topic are likely to have similar structural elements. Though we present our experiments only for Hindi, our approach is language independent and can be easily extended to other languages. The major contribution of our work is that the dictionary contains translation and transliteration of words which include Named Entities to a large extent. We evaluate our dictionary using manually computed precision. We generated a massive list of 72k tokens using our approach with 0.75 precision.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[Compilación de diccionarios bilingües usando Wikipedia ha sido estudiada mucho en la lingüística computacional. Estos diccionarios juegan un papel crítico en tales aplicaciones del procesamiento de lenguaje natural (PLN) como recuperación de información inter-lingüística, traducción automática y reconocimiento de nombres. La mayoría de los enfoques existentes para la construcción de estos diccionarios usa la información presente en títulos de Wikipedia, info-boxes y categorías. Es interesante que pocos investigadores hayan usado las propiedades estructurales de documentos tales como secciones, sub-secciones, etc. Este trabajo utiliza las propiedades estructurales de documentos para construir un diccionario bilingüe inglés-hindi. La intuición principal en la cual se basa este enfoque es el hecho de que la discusión de un cierto tema en idiomas diferentes puede tener los elementes estructurales similares. Los experimentos se realizaron sólo para hindi, pero el enfoque no depende del idioma particular y puede ser extendida fácilmente a otros idiomas. La mayor aportación de este trabajo es la inclusión en el diccionario las palabras que son nombres traducidos y transliterados. El diccionario fue evaluado mediante la precisión calculada manualmente. Se generó una lista muy grande de 72K tokens usando el enfoque propuesto con la precisión de 0.75.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Bilingual dictionary]]></kwd>
<kwd lng="en"><![CDATA[comparable corpora]]></kwd>
<kwd lng="en"><![CDATA[structural elements]]></kwd>
<kwd lng="es"><![CDATA[Diccionario bilingüe]]></kwd>
<kwd lng="es"><![CDATA[corpus comparable]]></kwd>
<kwd lng="es"><![CDATA[elementos estructurales]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="justify"><font face="verdana" size="4">Art&iacute;culos</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="4"><b>Generation of Bilingual Dictionaries using Structural Properties</b></font></p>  	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="3"><b>Generaci&oacute;n de diccionarios biling&uuml;es usando las propiedades estructurales</b></font></p>  	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="2"><b>Ajay Dubey</b><sup><b>1</b></sup> <b>and Vasudeva Varma</b><sup><b>2</b></sup></font></p>  	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><sup><i>1</i></sup> <i>Search and Information Extraction Laboratory, International Institute of Information Technology, Hyderabad, AP, 500032, India</i> <a href="mailto:ajay.dubey@research.iiit.ac.in">ajay.dubey@research.iiit.ac.in</a></font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><sup><i>2</i></sup> <i>Search and Information Extraction Laboratory, International Institute of Information Technology, Hyderabad, AP, 500032, India</i> <a href="mailto:vv@iiit.ac.in">vv@iiit.ac.in</a></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2">Article received on 06/12/2012    <br> 	Accepted on 11/01/2013.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Building bilingual dictionaries from Wikipedia has been extensively studied in the area of computation linguistics. These dictionaries play a crucial role in Natural Language Processing(NLP) applications like Cross&#45;Lingual Information Retrieval, Machine Translation and Named Entity Recognition. To build these dictionaries, most of the existing approaches use information present in Wikipedia titles, info&#45;boxes and categories. Interestingly, not many use the structural properties of a document like sections, subsections, etc. In this work we exploit the structural properties of documents to build a bilingual English&#45;Hindi dictionary. The main intuition behind this approach is that documents in different languages discussing the same topic are likely to have similar structural elements. Though we present our experiments only for Hindi, our approach is language independent and can be easily extended to other languages. The major contribution of our work is that the dictionary contains translation and transliteration of words which include Named Entities to a large extent. We evaluate our dictionary using manually computed precision. We generated a massive list of 72k tokens using our approach with 0.75 precision.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Keywords:</b> Bilingual dictionary, comparable corpora, structural elements.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Resumen</b></font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">Compilaci&oacute;n de diccionarios biling&uuml;es usando Wikipedia ha sido estudiada mucho en la ling&uuml;&iacute;stica computacional. Estos diccionarios juegan un papel cr&iacute;tico en tales aplicaciones del procesamiento de lenguaje natural (PLN) como recuperaci&oacute;n de informaci&oacute;n inter&#45;ling&uuml;&iacute;stica, traducci&oacute;n autom&aacute;tica y reconocimiento de nombres. La mayor&iacute;a de los enfoques existentes para la construcci&oacute;n de estos diccionarios usa la informaci&oacute;n presente en t&iacute;tulos de Wikipedia, info&#45;boxes y categor&iacute;as. Es interesante que pocos investigadores hayan usado las propiedades estructurales de documentos tales como secciones, sub&#45;secciones, etc. Este trabajo utiliza las propiedades estructurales de documentos para construir un diccionario biling&uuml;e ingl&eacute;s&#45;hindi. La intuici&oacute;n principal en la cual se basa este enfoque es el hecho de que la discusi&oacute;n de un cierto tema en idiomas diferentes puede tener los elementes estructurales similares. Los experimentos se realizaron s&oacute;lo para hindi, pero el enfoque no depende del idioma particular y puede ser extendida f&aacute;cilmente a otros idiomas. La mayor aportaci&oacute;n de este trabajo es la inclusi&oacute;n en el diccionario las palabras que son nombres traducidos y transliterados. El diccionario fue evaluado mediante la precisi&oacute;n calculada manualmente. Se gener&oacute; una lista muy grande de 72K tokens usando el enfoque propuesto con la precisi&oacute;n de 0.75.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Palabras clave:</b> Diccionario biling&uuml;e, corpus comparable, elementos estructurales.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><a href="/pdf/cys/v17n2/v17n2a6.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>References</b></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>1. Adafre, S. &amp; de Rijke, M. (2006).</b> Finding similar sentences across multiple languages in wikipedia. In <i>Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics.</i> 62&#45;69.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060827&pid=S1405-5546201300020000600001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>2. Bharadwaj, R. &amp; Varma, V. (2011</b>). Language independent identification of parallel sentences using wikipedia. In <i>Proceedings of the 20th international conference companion on World wide web.</i> ACM, 11&#45;12.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060829&pid=S1405-5546201300020000600002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>3. Breen, J. (2004).</b> Jmdict: a japanese&#45;multilingual dictionary. In <i>Proceedings of the Workshop on Multilingual Linguistic Ressources.</i> Association for Computational Linguistics, 71&#45;79.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060831&pid=S1405-5546201300020000600003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>4. Brown, P., Cocke, J., Pietra, S., Pietra, V., Jelinek, F., Lafferty, J., Mercer, R., &amp; Roossin, P. (1990).</b> A statistical approach to machine translation. <i>Computational linguistics,</i> 16(2), 79&#45;85.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060833&pid=S1405-5546201300020000600004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>5. Erdmann, M., Nakayama, K., Hara, T., &amp; Nishio, S. (2008).</b> An approach for extracting bilingual terminology from wikipedia. In <i>Database Systems for Advanced Applications.</i> Springer, 380&#45;392.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060835&pid=S1405-5546201300020000600005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>6. Erdmann, M., Nakayama, K., Hara, T., &amp; Nishio, S. (2009).</b> Improving the extraction of bilingual terminology from wikipedia. <i>ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP),</i> 5(4), 31.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060837&pid=S1405-5546201300020000600006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>7. Fatiha, S. (2011).</b> Extracting the multilingual terminology from a web&#45;based encyclopedia. In <i>Research Challenges in Information Science (RCIS), 2011 Fifth International Conference on.</i> IEEE, 1&#45;5.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060839&pid=S1405-5546201300020000600007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>8. Fung, P. &amp; McKeown, K. (1997).</b> A technical word&#45;and term&#45;translation aid using noisy parallel corpora across language groups. <i>Machine</i> <i>Translation,</i> 12(1), 53&#45;87.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060841&pid=S1405-5546201300020000600008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>9. Kay, M. &amp; R&ouml;scheisen, M. (1993).</b> Text&#45;translation alignment. <i>computational Linguistics,</i> 19(1), 121&#45;142.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060843&pid=S1405-5546201300020000600009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>10. Laws, F., Michelbacher, L., Dorow, B., Scheible, C., Heid, U., &amp; Sch&uuml;tze, H. (2010).</b> A linguistically grounded graph model for bilingual lexicon extraction. In <i>Proceedings of the 23rd International Conference on Computational Linguistics: Posters.</i> Association for Computational Linguistics, 614&#45;622.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060845&pid=S1405-5546201300020000600010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>11. Melamed, I. (1997).</b> A word&#45;to&#45;word model of translational equivalence. In <i>Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics.</i> Association for Computational Linguistics, 490&#45;497.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060847&pid=S1405-5546201300020000600011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>12. Mohammadi, M. &amp; GhasemAghaee, N. (2010).</b> Building bilingual parallel corpora based on wikipedia. In <i>Computer Engineering and Applications (ICCEA), 2010 Second International Conference on,</i> volume 2. IEEE, 264&#45;268.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060849&pid=S1405-5546201300020000600012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>13. Nayan, A., Rao, B., Singh, P., Sanyal, S., &amp; Sanyal, R. (2008).</b> Named entity recognition for indian languages. <i>NER for South and South East Asian Languages,</i> 97.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060851&pid=S1405-5546201300020000600013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>14. Och, F. &amp; Ney, H. (2003).</b> A systematic comparison of various statistical alignment models. <i>Computational linguistics,</i> 29(1), 19&#45;51.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060853&pid=S1405-5546201300020000600014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>15. Rahimi, Z. &amp; Shakery, A. (2010).</b> Creating a wikipedia&#45;based persian&#45;english word association dictionary. In <i>Telecommunications (IST), 2010 5th International Symposium on.</i> IEEE, 562&#45;567.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060855&pid=S1405-5546201300020000600015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>16. Rohit Bharadwaj, G., Tandon, N., &amp; Varma, V. (2009).</b> An iterative approach to extract dictionaries from wikipedia for under&#45;resourced languages. <i>Proc. ICON2010.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060857&pid=S1405-5546201300020000600016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>17. Smith, J., Quirk, C., &amp; Toutanova, K. (2010).</b> Extracting parallel sentences from comparable corpora using document level alignment. In <i>Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics.</i> Association for Computational Linguistics, 403&#45;411.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060859&pid=S1405-5546201300020000600017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>18. Soderland, S., Etzioni, O., Weld, D., Skinner, M., Bilmes, J., et al. (2009).</b> Compiling a massive, multilingual dictionary via probabilistic inference. In <i>Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1&#45;Volume 1.</i> Association for Computational Linguistics, 262&#45;270.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060861&pid=S1405-5546201300020000600018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>19. Tyers, F. &amp; Pienaar, J. (2008).</b> Extracting bilingual word pairs from wikipedia. <i>Collaboration: interoperability between people in the creation of language resources for less&#45;resourced languages,</i> 19.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060863&pid=S1405-5546201300020000600019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>20. Zobel, J. &amp; Dart, P. (1996).</b> Phonetic string matching: Lessons from information retrieval. In <i>Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval.</i> ACM, 166&#45;172.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2060865&pid=S1405-5546201300020000600020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Adafre]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[de Rijke]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Finding similar sentences across multiple languages in wikipedia]]></article-title>
<source><![CDATA[Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics]]></source>
<year>2006</year>
<page-range>62-69</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bharadwaj]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Varma]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Language independent identification of parallel sentences using wikipedia]]></article-title>
<source><![CDATA[Proceedings of the 20th international conference companion on World wide web]]></source>
<year>2011</year>
<page-range>11-12</page-range><publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Breen]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Jmdict: a japanese-multilingual dictionary]]></article-title>
<source><![CDATA[Proceedings of the Workshop on Multilingual Linguistic Ressources]]></source>
<year>2004</year>
<page-range>71-79</page-range><publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brown]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Cocke]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Pietra]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Pietra]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Jelinek]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Lafferty]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Mercer]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Roossin]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A statistical approach to machine translation]]></article-title>
<source><![CDATA[Computational linguistics]]></source>
<year>1990</year>
<volume>16</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>79-85</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Erdmann]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Nakayama]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Hara]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Nishio]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An approach for extracting bilingual terminology from wikipedia]]></article-title>
<source><![CDATA[Database Systems for Advanced Applications]]></source>
<year>2008</year>
<page-range>380-392</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Erdmann]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Nakayama]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Hara]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Nishio]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Improving the extraction of bilingual terminology from wikipedia]]></article-title>
<source><![CDATA[ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)]]></source>
<year>2009</year>
<volume>5</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>31</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fatiha]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Extracting the multilingual terminology from a web-based encyclopedia]]></article-title>
<source><![CDATA[Research Challenges in Information Science (RCIS), 2011 Fifth International Conference on]]></source>
<year>2011</year>
<page-range>1-5</page-range><publisher-name><![CDATA[IEEE]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fung]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[McKeown]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A technical word-and term-translation aid using noisy parallel corpora across language groups]]></article-title>
<source><![CDATA[Machine Translation]]></source>
<year>1997</year>
<volume>12</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>53-87</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kay]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Röscheisen]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Text-translation alignment]]></article-title>
<source><![CDATA[computational Linguistics]]></source>
<year>1993</year>
<volume>19</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>121-142</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Laws]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Michelbacher]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Dorow]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Scheible]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Heid]]></surname>
<given-names><![CDATA[U.]]></given-names>
</name>
<name>
<surname><![CDATA[Schütze]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A linguistically grounded graph model for bilingual lexicon extraction]]></article-title>
<source><![CDATA[Proceedings of the 23rd International Conference on Computational Linguistics: Posters]]></source>
<year>2010</year>
<page-range>614-622</page-range><publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Melamed]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A word-to-word model of translational equivalence]]></article-title>
<source><![CDATA[Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics]]></source>
<year>1997</year>
<page-range>490-497</page-range><publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mohammadi]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[GhasemAghaee]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Building bilingual parallel corpora based on wikipedia]]></article-title>
<source><![CDATA[Computer Engineering and Applications (ICCEA), 2010 Second International Conference on]]></source>
<year>2010</year>
<volume>2</volume>
<page-range>264-268</page-range><publisher-name><![CDATA[IEEE]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nayan]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Rao]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Sanyal]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Sanyal]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Named entity recognition for indian languages]]></article-title>
<source><![CDATA[NER for South and South East Asian Languages]]></source>
<year>2008</year>
<page-range>97</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Och]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Ney]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A systematic comparison of various statistical alignment models]]></article-title>
<source><![CDATA[Computational linguistics]]></source>
<year>2003</year>
<volume>29</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>19-51</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rahimi]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Shakery]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Creating a wikipedia-based persian-english word association dictionary]]></article-title>
<source><![CDATA[In Telecommunications (IST), 2010 5th International Symposium on]]></source>
<year></year>
<page-range>562-567</page-range><publisher-name><![CDATA[IEEE]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rohit Bharadwaj]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Tandon]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Varma]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An iterative approach to extract dictionaries from wikipedia for under-resourced languages]]></article-title>
<source><![CDATA[Proc. ICON2010]]></source>
<year>2009</year>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Smith]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Quirk]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Toutanova]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Extracting parallel sentences from comparable corpora using document level alignment]]></article-title>
<source><![CDATA[Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics]]></source>
<year>2010</year>
<page-range>403-411</page-range><publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Soderland]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Etzioni]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Weld]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Skinner]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bilmes]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Compiling a massive, multilingual dictionary via probabilistic inference]]></article-title>
<source><![CDATA[Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1-Volume 1]]></source>
<year>2009</year>
<page-range>262-270</page-range><publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tyers]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Pienaar]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Extracting bilingual word pairs from wikipedia]]></article-title>
<source><![CDATA[Collaboration: interoperability between people in the creation of language resources for less-resourced languages]]></source>
<year>2008</year>
<page-range>19</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zobel]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Dart]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Phonetic string matching: Lessons from information retrieval]]></article-title>
<source><![CDATA[Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval]]></source>
<year>1996</year>
<page-range>166-172</page-range><publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
