SciELO - Scientific Electronic Library Online

 
vol.19 número4Recognizing Textual Entailment by Soft Dependency Tree MatchingIdentification of Verbal Phraseological Units in Mexican News Stories índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

SELLAM, Rahma; DEFFAF, Fatima; SADAT, Fatiha  y  BELGUITH, Lamia Hadrich. Improved Statistical Machine Translation by Cross-Lingustic Projection of Named Entities Recognition and Translation. Comp. y Sist. [online]. 2015, vol.19, n.4, pp.701-711. ISSN 2007-9737.  https://doi.org/10.13053/CyS-19-4-2329.

One of the existing difficulties in natural language processing applications is the lack of appropritate tools for the recognition, translation, and/or transliteration of named entities (NEs), specifically for less- resourced languages. In this paper, we propose a new method to automatically label multilingual parallel data for Arabic-French pair of languages with named entity tags and build lexicons of those named entities with their transliteration and/or translation in the target language. For this purpose, we bring in a third well-resourced language, English, that might serve as pivot, in order to build an Arabic-French NE Translation lexicon. Evaluations on the Arabic-French pair of languages using English as pivot in the transitive model showed the effectiveness of the proposed method for mining Arabic- French named entities and their translations. Moreover, the integration of this component in statistical machine translation outperformed the baseline system.

Palabras llave : Named entity; pivot language; machine translation.

        · texto en Inglés     · Inglés ( pdf )