Services on Demand
Journal
Article
Indicators
- Cited by SciELO
- Access statistics
Related links
- Similars in SciELO
Share
Polibits
On-line version ISSN 1870-9044
Polibits n.39 México Jan./Jun. 2009
Articles
Application of Pronominal Divergence and Anaphora Resolution in EnglishHindi Machine Translation
Kamlesh Dutta1, Nupur Prakash2, and Saroj Kaushik3
1 Computer Science & Engineering Department, National Institute of Technology, Hamirpur177005 (HP), India (phone: +9119723044424; fax: +911972223834, email: kdnith@gmail.com).
2 School of Information Technology, Guru Gobind Singh Inderprastha University, Delhi. Currently she is on deputation as additional director, ICAI, India (email: nupurprakash@rediffmail.com).
3 Computer Science & Engineering Department, Indian Institute of Technology. Delhi, India (email: saroj@cse.iitd.ac.in).
Manuscript received March 23, 2008.
Manuscript accepted for publication March 04, 2009.
Abstract
So far the majority of Machine Translation (MT) research has focused on translation at the level of individual sentences. For sentence level translation, Machine Translation has addressed various divergence issues for large variety of languages; the issue of pronominal divergence has been presented only recently. Since the quality of translation as required by users follows coherent multisentence discourse structure in a specific context, the pronominal divergence helps us in understanding the nuances of translation arising out of disparity in the languages. Subsequently using clues from this divergence, the anaphora resolution system can find the correct interpretation for the given pronominal referents and other entities by resolving the intersentential context. In the literature, researchers have examined the issue and have proposed ways for their classification and resolution of anaphora. However for Indic languages, not many studies are available. In this paper, we discuss different aspects of pronominal divergence that affects the anaphora resolution in English Hindi Machine Translation (EHMT). The study shall be helpful in developing approaches that can explicitly use intersentential information in order to resolve specific types of ambiguity and which can generate coherent multisentence discourse structure in the target language to produce higher quality of translation Machine Translation.
Key words: Pronominal, anaphora, machine translation, divergence.
DESCARGAR ARTÍCULO EN FORMATO PDF
REFERENCES
[1] R. Mitkov, Anaphora Resolution, Pearson Education. Longman, London. 2002. [ Links ]
[2] R. Mitkov, S. K. Choi and R. Sharp, "Anaphora Resolution in Machine Translation," in Proceedings of the Sixth International Conference on Theoretical and Methodological Issues in Machine Translation TMI95, pp. 8795, Leuven, Belgium, 1995. [ Links ]
[3] A. F. Gelbukh and G. Sidorov, "On Indirect Anaphora Resolution," in Proc. PACLING99, Pacific Association for Computational Linguistics, pp. 181190, Waterloo, Ontario, Canada, August 2528, 1999. [ Links ]
[4] D. Gupta and N. Chaterjee, "Identification of Divergence for English to Hindi EBMT," in Proceeding of MT Summit IX, pp. 141148, 2003. [ Links ]
[5] R. Evans, "Applying Machine Learning Toward an Automatic Classification of It," Literary and Linguistic Computing, Vol. 16. No. 1, Oxford University Press, pp. 4557, 2001. [ Links ]
[6] http://www.cse.iitk.ac.in [ Links ]
[7] http://202.141.152.9/matra/index.jsp [ Links ]
[8] http://translate.google.com/ [ Links ]
[9] B.J. Dorr, "Machine Translation Divergences: A Formal Description and Proposed Solution," Computational Linguistics, Vol. 20, Number 4, pp. 597633, 1994. [ Links ]
[10] B. J. Dorr, L. Pearl, R. Hwa and N. Habash, "DUSTer: A Method for Unraveling CrossLanguage Divergences for Statistical WordLevel Alignment," Machine Translation: From Research to Real Users, LNCS 2499, pp. 3143, 2003. [ Links ]