<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442010000100003</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Semi-Automatic Parallel Corpora Extraction from Comparable News Corpora]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[Thoudam Doren]]></given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Bandyopadhyay]]></surname>
<given-names><![CDATA[Sivaji]]></given-names>
</name>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Jadavpur University Computer Science and Engineering Department ]]></institution>
<addr-line><![CDATA[ Kolkata]]></addr-line>
<country>India</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2010</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2010</year>
</pub-date>
<numero>41</numero>
<fpage>11</fpage>
<lpage>17</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442010000100003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442010000100003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442010000100003&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[The parallel corpus is a necessary resource in many multi/cross lingual natural language processing applications that include Machine Translation and Cross Lingual Information Retreival. Preparation of large scale parallel corpus takes time and also demands the linguistics skill. In the present work, a technique has been developed that extracts parallel corpus between Manipuri, a morphologically rich and resource constrained Indian language and English from a comparable news corpora collected from the web. A medium sized Manipuri-English bilingual lexicon and another list of Manipuri-English transliterated entities have been developed and used in the present work. Using morphological information for the agglutinative and inflective Manipuri language, the alignment quality based on similarity measure is further improved. A high level of performance is desirable since errors in sentence alignment cause further errors in systems that use the aligned text. The system has been evaluated and error analysis has also been carried out. The technique shows its effectiveness in Manipuri-English language pair and is extendable to other resource constrained, agglutinative and inflective Indian languages.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Parallel corpora]]></kwd>
<kwd lng="en"><![CDATA[similarity measure]]></kwd>
<kwd lng="en"><![CDATA[bilingual lexicon]]></kwd>
<kwd lng="en"><![CDATA[morphology]]></kwd>
<kwd lng="en"><![CDATA[named entity list]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="justify"><font face="verdana" size="4">Special section: processing of semantic information</font></p>     <p align="justify"><font face="verdana" size="4">&nbsp;</font></p>     <p align="center"><font face="verdana" size="4"><b>Semi&#150;Automatic Parallel Corpora Extraction from Comparable News Corpora</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="2"><b>Thoudam Doren Singh and Sivaji Bandyopadhyay</b></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><i>Computer Science and Engineering Department, Jadavpur University, Kolkata, India.</i> (<a href="mailto:thoudam.doren@gmail.com">thoudam.doren@gmail.com</a>, <a href="mailto:sivaji_cse_ju@yahoo.com">sivaji_cse_ju@yahoo.com</a>).</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2">Manuscript received February 15, 2010.     <br> Manuscript accepted for publication May 31, 2010.</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Abstract </b></font></p>     <p align="justify"><font face="verdana" size="2">The parallel corpus is a necessary resource in many multi/cross lingual natural language processing applications that include Machine Translation and Cross Lingual Information Retreival. Preparation of large scale parallel corpus takes time and also demands the linguistics skill. In the present work, a technique has been developed that extracts parallel corpus between Manipuri, a morphologically rich and resource constrained Indian language and English from a comparable news corpora collected from the web. A medium sized Manipuri&#150;English bilingual lexicon and another list of Manipuri&#150;English transliterated entities have been developed and used in the present work. Using morphological information for the agglutinative and inflective Manipuri language, the alignment quality based on similarity measure is further improved. A high level of performance is desirable since errors in sentence alignment cause further errors in systems that use the aligned text. The system has been evaluated and error analysis has also been carried out. The technique shows its effectiveness in Manipuri&#150;English language pair and is extendable to other resource constrained, agglutinative and inflective Indian languages.</font></p>     <p align="justify"><font face="verdana" size="2"><b>Key words: </b>Parallel corpora, similarity measure, bilingual lexicon, morphology, named entity list.arallel corpora, similarity measure, bilingual lexicon, morphology, named entity list.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><a href="/pdf/poli/n41/n41a3.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>REFERENCES</b></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; W. A. Gale and K. W. Church, "A program for aligning sentences in bilingual corpora," in <i>Proceedings of 29th Annual Meeting ofthe Association for Computational Linguistics, </i>Berkeley, California, 1991, pp. 177&#150;184.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043343&pid=S1870-9044201000010000300001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; P. Koehn, "A parallel corpus for statistical machine translation," in <i>In </i><i>MT Summit X, </i>2005.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043345&pid=S1870-9044201000010000300002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;3&#93; M. Kay and M. Roscheisen, "Text translation alignment," in <i>Computational Linguistics, </i>1993, pp. 121&#150;142.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043347&pid=S1870-9044201000010000300003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;4&#93; P. F. Brown, J. C. Lai, and R. L. Mercer, "Aligning sentences in parallel corpora," in <i>Proceedings of 29th Annual Meeting ofthe Association for </i><i>Computational Linguistics, </i>Berkeley, California, 1991, pp. 169&#150;176.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043349&pid=S1870-9044201000010000300004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;5&#93; P. F. Brown, S. A. D. Pietra, V. J. D. Pietra, and R. L. Mercer, "Mathematics of statistical machine translation: Parameter estimation," in <i>Computational Linguistics, </i>1993, pp. 163&#150;311.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043351&pid=S1870-9044201000010000300005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; M. Simard and P. Plamondon, "Bilingual sentence alignment: Balancing robustness and accuracy," in <i>Machine Translation, 13(1), </i>1998, pp. 59&#150;80.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043353&pid=S1870-9044201000010000300006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; K. W. Church, "Char align: A program for aligning parallel texts at the character level," in <i>Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, </i>Columbus, Ohio, 1993, pp. 1&#150;8.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043355&pid=S1870-9044201000010000300007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; S. F. Chen, "Aligning sentences in bilingual corpora using lexical information," in <i>Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, </i>Columbus, Ohio, 1993, pp. 9&#150;16.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043357&pid=S1870-9044201000010000300008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;9&#93; N. Collier, H. Hirakawa, and A. Kumano, "Machine translation vs. dictionary term translation &#150; a comparison for english&#150;japanese news article alignment," in <i>In COLING&#150;ACL 98, </i>1998, pp. 263&#150;267.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043359&pid=S1870-9044201000010000300009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; K. Matsumoto and H. Tanaka, "Automatic alignment of japanese and english newspaper articles using an mt system and a bilingual company name dictionary," in <i>In LREC&#150;2002, </i>2002, pp. 480&#150;484.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043361&pid=S1870-9044201000010000300010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; M. Utiyama and H. Isahara, "Reliable measures for aligning japanese&#150;english news articles and sentences," in <i>Proceedings of the 41st Annual Meeting on Association for Computational Linguistics &#150; Volume 1, </i>Sapporo, Japan, 2003, pp. 72&#150;79.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043363&pid=S1870-9044201000010000300011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;12&#93; A. K. Singh and S. Husain, "Comparison, selection and use of sentence alignment algorithms for new language pairs," in <i>Proceedings ofthe ACL&#150;05: Association for Computational Linguistics Workshop, </i>Ann Arbor, USA, 2005, pp. 177&#150;184.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043365&pid=S1870-9044201000010000300012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;13&#93; T. Utsuro, H. Ikeda, M. Yamane, Y. Matsumoto, and M. Nagao, "Bilingual text matching using bilingual dictionary and statistics," in <i>In COLING' 94, </i>1994, pp. 1076&#150;1082.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043367&pid=S1870-9044201000010000300013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;14&#93; S. I. Singh, "Manipuri to english dictionary." Imphal, India: S. Ibetombi Devi, 2004.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043369&pid=S1870-9044201000010000300014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;15&#93; T. D. Singh, N. Kishorjit, A. Ekbal, and S. Bandyopadhyay, "Named entity recognition for manipuri using support vector machine," in <i>In Proceedings of PACLIC 23, </i>Hong Kong, 2009, pp. 811&#150;818.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043371&pid=S1870-9044201000010000300015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;16&#93; A. Ekbal, S. K. Naskar, and S. Bandyopadhyay, "A modified joint source&#150;channel model for transliteration," in <i>Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions. </i>Sydney: Association for Computational Linguistics, 2006, pp. 191&#150;198.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043373&pid=S1870-9044201000010000300016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;17&#93; T. D. Singh and S. Bandyopadhyay, "Manipuri morphological analyzer," in <i>In the Proceedings of the Platinum Jubilee International Conference </i><i>of LSI, </i>Hyderabad, India, 2005.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043375&pid=S1870-9044201000010000300017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;18&#93; &#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;, "Word class and sentence type identification in manipuri morphological analyzer," in <i>In Proceedings of MSPIL, </i>Mumbai, India, 2006, pp. 11&#150;17.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043377&pid=S1870-9044201000010000300018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;19&#93; &#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;, "Morphology driven manipuri pos tagger," in <i>In Proceedings </i><i>of IJCNLP&#150;08 Workshop on NLP for Less Privileged Languages, </i>Hyderabad, India, 2008, pp. 91&#150;98.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043379&pid=S1870-9044201000010000300019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;20&#93; &#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;, "Manipuri&#150;english example based machine translation system," in <i>International Journal of Computational Linguistics and Applications (IJCLA), ISSN 0976&#150;0962. </i>Delhi, India: Bahri Publication, 2010, pp. 147&#150;158.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043381&pid=S1870-9044201000010000300020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;21&#93; P. Langlais, M. Simard, and J. Veronis, "Methods and practical issues in evaluating alignment techniques," in <i>Proceedings of16th International Conference on Computational Linguistics (COLING&#150;96), </i>1996.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6043383&pid=S1870-9044201000010000300021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[ ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gale]]></surname>
<given-names><![CDATA[W. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Church]]></surname>
<given-names><![CDATA[K. W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A program for aligning sentences in bilingual corpora]]></article-title>
<source><![CDATA[Proceedings of 29th Annual Meeting ofthe Association for Computational Linguistics]]></source>
<year>1991</year>
<page-range>177-184</page-range><publisher-loc><![CDATA[Berkeley^eCalifornia California]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Koehn]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A parallel corpus for statistical machine translation]]></article-title>
<source><![CDATA[MT Summit X]]></source>
<year>2005</year>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kay]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Roscheisen]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Text translation alignment]]></article-title>
<source><![CDATA[Computational Linguistics]]></source>
<year>1993</year>
<page-range>121-142</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brown]]></surname>
<given-names><![CDATA[P. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Lai]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Mercer]]></surname>
<given-names><![CDATA[R. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Aligning sentences in parallel corpora]]></article-title>
<source><![CDATA[Proceedings of 29th Annual Meeting ofthe Association for Computational Linguistics]]></source>
<year>1991</year>
<page-range>169-176</page-range><publisher-loc><![CDATA[Berkeley^eCalifornia California]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brown]]></surname>
<given-names><![CDATA[P. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Pietra]]></surname>
<given-names><![CDATA[S. A. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Pietra]]></surname>
<given-names><![CDATA[V. J. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Mercer]]></surname>
<given-names><![CDATA[R. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Mathematics of statistical machine translation: Parameter estimation]]></article-title>
<source><![CDATA[Computational Linguistics]]></source>
<year>1993</year>
<page-range>163-311</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Simard]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Plamondon]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Bilingual sentence alignment: Balancing robustness and accuracy]]></article-title>
<source><![CDATA[Machine Translation]]></source>
<year>1998</year>
<volume>13</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>59-80</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Church]]></surname>
<given-names><![CDATA[K. W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Char align: A program for aligning parallel texts at the character level]]></article-title>
<source><![CDATA[Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics]]></source>
<year>1993</year>
<page-range>1-8</page-range><publisher-loc><![CDATA[Columbus ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[S. F.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Aligning sentences in bilingual corpora using lexical information]]></article-title>
<source><![CDATA[Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics]]></source>
<year>1993</year>
<page-range>9-16</page-range><publisher-loc><![CDATA[Columbus ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Collier]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Hirakawa]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Kumano]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Machine translation vs. dictionary term translation - a comparison for english-japanese news article alignment]]></article-title>
<source><![CDATA[COLING-ACL 98]]></source>
<year>1998</year>
<page-range>263-267</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Matsumoto]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Tanaka]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic alignment of japanese and english newspaper articles using an mt system and a bilingual company name dictionary]]></article-title>
<source><![CDATA[LREC-2002]]></source>
<year>2002</year>
<page-range>480-484</page-range></nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Utiyama]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Isahara]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Reliable measures for aligning japanese-english news articles and sentences]]></article-title>
<source><![CDATA[Proceedings of the 41st Annual Meeting on Association for Computational Linguistics]]></source>
<year>2003</year>
<volume>1</volume>
<page-range>72-79</page-range><publisher-loc><![CDATA[Sapporo ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[A. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Husain]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Comparison, selection and use of sentence alignment algorithms for new language pairs]]></article-title>
<source><![CDATA[Proceedings of the ACL-05: Association for Computational Linguistics Workshop]]></source>
<year>2005</year>
<page-range>177-184</page-range><publisher-loc><![CDATA[Ann Arbor ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Utsuro]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Ikeda]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Yamane]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Matsumoto]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Nagao]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Bilingual text matching using bilingual dictionary and statistics]]></article-title>
<source><![CDATA[COLING' 94]]></source>
<year>1994</year>
<page-range>1076-1082</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[S. I.]]></given-names>
</name>
</person-group>
<source><![CDATA[Manipuri to english dictionary]]></source>
<year>2004</year>
<publisher-loc><![CDATA[Imphal ]]></publisher-loc>
<publisher-name><![CDATA[S. Ibetombi Devi]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[T. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Kishorjit]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Bandyopadhyay]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Named entity recognition for manipuri using support vector machine]]></article-title>
<source><![CDATA[Proceedings of PACLIC 23]]></source>
<year>2009</year>
<page-range>811-818</page-range><publisher-loc><![CDATA[Hong Kong ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Naskar]]></surname>
<given-names><![CDATA[S. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Bandyopadhyay]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A modified joint source-channel model for transliteration]]></article-title>
<source><![CDATA[Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions]]></source>
<year>2006</year>
<page-range>191-198</page-range><publisher-loc><![CDATA[Sydney ]]></publisher-loc>
<publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[T. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Bandyopadhyay]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Manipuri morphological analyzer]]></article-title>
<source><![CDATA[the Proceedings of the Platinum Jubilee International Conference of LSI]]></source>
<year>2005</year>
<publisher-loc><![CDATA[Hyderabad ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[T. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Word class and sentence type identification in manipuri morphological analyzer]]></article-title>
<source><![CDATA[Proceedings of MSPIL]]></source>
<year>2006</year>
<page-range>11-17</page-range><publisher-loc><![CDATA[Mumbai ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[T. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Morphology driven manipuri pos tagger]]></article-title>
<source><![CDATA[Proceedings of IJCNLP-08 Workshop on NLP for Less Privileged Languages]]></source>
<year>2008</year>
<page-range>91-98</page-range><publisher-loc><![CDATA[Hyderabad ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[T. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Manipuri-english example based machine translation system]]></article-title>
<source><![CDATA[International Journal of Computational Linguistics and Applications]]></source>
<year>2010</year>
<page-range>147-158</page-range><publisher-loc><![CDATA[Delhi ]]></publisher-loc>
<publisher-name><![CDATA[Bahri Publication]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Langlais]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Simard]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Veronis]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Methods and practical issues in evaluating alignment techniques]]></article-title>
<source><![CDATA[Proceedings of 16th International Conference on Computational Linguistics (COLING-96)]]></source>
<year></year>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
