<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462015000400701</article-id>
<article-id pub-id-type="doi">10.13053/CyS-19-4-2329</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Improved Statistical Machine Translation by Cross-Lingustic Projection of Named Entities Recognition and Translation]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Sellam]]></surname>
<given-names><![CDATA[Rahma]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Deffaf]]></surname>
<given-names><![CDATA[Fatima]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Sadat]]></surname>
<given-names><![CDATA[Fatiha]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Belguith]]></surname>
<given-names><![CDATA[Lamia Hadrich]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Sfax University ANLP Research Group ]]></institution>
<addr-line><![CDATA[Sfax ]]></addr-line>
<country>Tunisia</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,UQAM  ]]></institution>
<addr-line><![CDATA[Montreal ]]></addr-line>
<country>Canada</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2015</year>
</pub-date>
<volume>19</volume>
<numero>4</numero>
<fpage>701</fpage>
<lpage>711</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462015000400701&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462015000400701&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462015000400701&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: One of the existing difficulties in natural language processing applications is the lack of appropritate tools for the recognition, translation, and/or transliteration of named entities (NEs), specifically for less- resourced languages. In this paper, we propose a new method to automatically label multilingual parallel data for Arabic-French pair of languages with named entity tags and build lexicons of those named entities with their transliteration and/or translation in the target language. For this purpose, we bring in a third well-resourced language, English, that might serve as pivot, in order to build an Arabic-French NE Translation lexicon. Evaluations on the Arabic-French pair of languages using English as pivot in the transitive model showed the effectiveness of the proposed method for mining Arabic- French named entities and their translations. Moreover, the integration of this component in statistical machine translation outperformed the baseline system.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Named entity]]></kwd>
<kwd lng="en"><![CDATA[pivot language]]></kwd>
<kwd lng="en"><![CDATA[machine translation]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Al-Onaizan]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Knight]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Translating named entities using monolingual and bilingual resources]]></source>
<year>2002</year>
<conf-name><![CDATA[ 40Annual Meeting of the Association for Computational Linguistics]]></conf-name>
<conf-loc>Philadelphia, Pennsylvania, USA </conf-loc>
<page-range>400-8</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Azab]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bouamor]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Mohit]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Oflazer]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Dudley north visits north london: Learning when to transliterate to Arabic]]></source>
<year>2013</year>
<conf-name><![CDATA[ HLT/NAACL]]></conf-name>
<conf-date>2013</conf-date>
<conf-loc>Atlanta, USA </conf-loc>
<page-range>439-44</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Buckwalte]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Buckwalter Arabic Morphological Analyzer Version 1.0]]></source>
<year>2002</year>
<publisher-name><![CDATA[Linguistic Data Consortium, University of Pennsylvania]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[H.-H.]]></given-names>
</name>
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Lin]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Learning formulation and transformation rules for multilingual named entities]]></source>
<year>2003</year>
<conf-name><![CDATA[ ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>1-8</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Darwish]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Transliteration mining with phonetic conflation and iterative training]]></source>
<year>2010</year>
<conf-name><![CDATA[ 2010 Named Entities Workshop, NEWS '10]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>53-6</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Feng]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Lü]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhou]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[A new approach for English-Chinese named entity alignment]]></source>
<year>2004</year>
<conf-name><![CDATA[ Conference on Empirical Methods in Natural Language Processing]]></conf-name>
<conf-date>2004</conf-date>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Finkel]]></surname>
<given-names><![CDATA[J. R.]]></given-names>
</name>
<name>
<surname><![CDATA[Grenager]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Incorporating non-local information into information extraction systems by Gibbs sampling]]></source>
<year>2005</year>
<conf-name><![CDATA[ 43Annual Meeting on Association for Computational Linguistics, ACL '05]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>363-70</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Traitement automatique des entites nommees en arabe : detection et traduction TAL]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gahbiche-Braham]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Bonneau-Maynard]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Yvon]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<source><![CDATA[Traitement Automatique des Langues]]></source>
<year>2014</year>
<volume>5</volume>
<numero>2</numero>
<issue>2</issue>
</nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gupta]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Rao]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Majumder]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[External plagiarism detection: N-gram approach using named entity recognizer - lab report for PAN at CLEF 2010]]></source>
<year>2010</year>
<publisher-name><![CDATA[CLEF]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Habash]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Four techniques for online handling of out-of-vocabulary words in Arabic-English statistical machine translation]]></source>
<year>2008</year>
<conf-name><![CDATA[ 46Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, Hl-T-Short '08]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>57-60</page-range></nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Huang]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Vogel]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Waibel]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Automatic extraction of named entity translingual equivalence based on multi-feature cost minimization]]></source>
<year>2003</year>
<conf-name><![CDATA[ ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>9-16</page-range></nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Koehn]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Hoang]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Birch]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Callison-Burch]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Federico]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bertoldi]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Cowan]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Shen]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Moran]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Zens]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Dyer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Bojar]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Constantin]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Herbst]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Moses: Open source toolkit for statistical machine translation]]></source>
<year>2007</year>
<conf-name><![CDATA[ 45Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, ACL. '07]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>177-80</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kumano]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Kashioka]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Tanaka]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Fukusima]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Acquiring bilingual named entity translations from content aligned corpora]]></source>
<year>2004</year>
<page-range>177-86</page-range><publisher-name><![CDATA[IJC- NLP]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Alignment of bilingual named entities in parallel corpora using statistical models and multiple knowledge sources]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[C.-J.]]></given-names>
</name>
<name>
<surname><![CDATA[Chang]]></surname>
<given-names><![CDATA[J. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Jang]]></surname>
<given-names><![CDATA[J.-S. R.]]></given-names>
</name>
</person-group>
<source><![CDATA[ACM Transactions on Asian Language Information Processing]]></source>
<year>2006</year>
<volume>5</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>121-45</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Binary codes capable of correcting deletions, insertions, and reversals]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Levenshtein]]></surname>
<given-names><![CDATA[V. I.]]></given-names>
</name>
</person-group>
<source><![CDATA[Soviet Physics Doklady]]></source>
<year>1966</year>
<volume>10</volume>
<page-range>707-10</page-range></nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Birnbaum]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[What do they think?: Aggregating local views about news events and topics]]></source>
<year>2008</year>
<conf-name><![CDATA[ 17International Conference on World Wide Web, WWW '08]]></conf-name>
<conf-loc>New York, NY, USA </conf-loc>
<page-range>1021-2</page-range></nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moore]]></surname>
<given-names><![CDATA[R. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Learning translations of named-entity phrases from parallel corpora]]></source>
<year>2003</year>
<conf-name><![CDATA[ TenthConference on European Chapter of the Association for Computational Linguistics]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>259-66</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Papineni]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[Roukos]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Ward]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[W.-J.]]></given-names>
</name>
</person-group>
<source><![CDATA[BLEU: A method for automatic evaluation of machine translation]]></source>
<year>2002</year>
<conf-name><![CDATA[ 40Annual Meeting for Association for Computational Linguistics, ACL. '02]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>311-8</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Samy]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Moreno]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Guirao]]></surname>
<given-names><![CDATA[J. M.]]></given-names>
</name>
</person-group>
<source><![CDATA[A proposal for an Arabic named entity tagger leveraging a parallel corpus (Spanish-Arabic)]]></source>
<year>2005</year>
<conf-name><![CDATA[ Recent Advances in Natural Language]]></conf-name>
<conf-loc> </conf-loc>
<page-range>459-65</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sellami]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Sadat]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Belguith Hadrich]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mining named entity translation from non parallel corpora]]></source>
<year>2014</year>
<page-range>219-24</page-range><publisher-name><![CDATA[FLAIRS]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[JRC-Names: A freely available, highly multilingual named entity resource]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Steinberger]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Pouliquen]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Kabadjov]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
<name>
<surname><![CDATA[der Goot]]></surname>
<given-names><![CDATA[E. V.]]></given-names>
</name>
</person-group>
<source><![CDATA[CoRR]]></source>
<year>2013</year>
<volume>abs/1309.6162</volume>
</nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[RENAR: A rule-based arabic named entity recognition system]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zaghouani]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<source><![CDATA[ACM Trans. Asian Lang. Inf. Process]]></source>
<year>2012</year>
<volume>11</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>2-13</page-range></nlm-citation>
</ref>
<ref id="B23">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zaghouani]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<source><![CDATA[Critical survey of the freely available Arabic corpora]]></source>
<year>2014</year>
<conf-name><![CDATA[ Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-8</page-range></nlm-citation>
</ref>
<ref id="B24">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zitouni]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Florian]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mention detection crossing the language barrier]]></source>
<year>2008</year>
<conf-name><![CDATA[ Conference on Empirical Methods in Natural Language]]></conf-name>
<conf-loc> </conf-loc>
<page-range>600-9</page-range></nlm-citation>
</ref>
<ref id="B25">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zobel]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Dart]]></surname>
<given-names><![CDATA[P. W.]]></given-names>
</name>
<name>
<surname><![CDATA[Frei]]></surname>
<given-names><![CDATA[H.-P.]]></given-names>
</name>
<name>
<surname><![CDATA[Harman]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Schauble]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Wilkinson]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Phonetic string matching: Lessons from information retrieval]]></source>
<year>1996</year>
<conf-name><![CDATA[ 19Conference and Research and Development in Information Retrieval]]></conf-name>
<conf-loc> </conf-loc>
<page-range>166-72</page-range><publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
