<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462021000400803</article-id>
<article-id pub-id-type="doi">10.13053/cys-25-4-4044</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Part-of-Speech Tagging for Mizo Language Using Conditional Random Field]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Nunsanga]]></surname>
<given-names><![CDATA[Morrel V. L.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Pakray]]></surname>
<given-names><![CDATA[Partha]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Lallawmsanga]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[L. Lolit Kumar]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Mizoram University Department of Information Technology ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>India</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,National Institute of Technology Silchar Department of Computer Science and Engineering ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>India</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Mizoram University Department of Electronics and Communication Engineering ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>India</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2021</year>
</pub-date>
<volume>25</volume>
<numero>4</numero>
<fpage>803</fpage>
<lpage>812</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462021000400803&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462021000400803&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462021000400803&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: Part of speech (POS) tagging assigns a class or tag to each token in a sentence. The tag allocated to a word is mainly its part of speech or any other class of interest. Several applications of Natural Language Processing (NLP) require it as a prerequisite. The development of part-of-speech tagging for the under-resourced Mizo language is presented in this study, which makes use of a stochastic model known as Conditional Random Field (CRF). The CRF is a discriminative probabilistic classifier that considers both the context of a given word and the tag transition probabilities in the training dataset. A corpus of approximately 30,000 words was collected and manually annotated with the proposed tagset for system evaluation. On various sizes of training and test sets, the tagger achieved 89.46 % accuracy, 89.3 % F1-score, 89.42 % precision, and 89.48 % recall.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Mizo POS tagging]]></kwd>
<kwd lng="en"><![CDATA[conditional random field]]></kwd>
<kwd lng="en"><![CDATA[Mizo part of speech tagger]]></kwd>
<kwd lng="en"><![CDATA[computational linguistics]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Awasthi]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Rao]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Ravindran]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part of speech tagging and chunking with HMM and CFR]]></source>
<year>2006</year>
<conf-name><![CDATA[ NLP Association of India (NLPAI) Machine Learning Contest]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Deshmukh]]></surname>
<given-names><![CDATA[R.D.]]></given-names>
</name>
<name>
<surname><![CDATA[Kiwelekar]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Deep learning techniques for part of speech tagging by natural language processing]]></source>
<year>2020</year>
<conf-name><![CDATA[ 2nd International Conference on Innovative Mechanisms for Industry Applications IEEE. (ICIMIA)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>76-81</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[T.D.]]></given-names>
</name>
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Bandyopadhyay]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Manipuri POS tagging using CRF and SVM: A language independent approach]]></source>
<year>2008</year>
<conf-name><![CDATA[ 6th International Conference on Natural Language Processing (ICON-2008)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>240-5</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pandian]]></surname>
<given-names><![CDATA[S.L.]]></given-names>
</name>
<name>
<surname><![CDATA[Geetha]]></surname>
<given-names><![CDATA[T.V.]]></given-names>
</name>
</person-group>
<source><![CDATA[CRF models for Tamil part of speech tagging and chunking]]></source>
<year>2009</year>
<conf-name><![CDATA[ International Conference on Computer Processing of Oriental Languages]]></conf-name>
<conf-loc> </conf-loc>
<page-range>11-22</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Outahajala]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Benajiba]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Zenkouar]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Pos tagging in Amazighe using support vector machines and conditional random fields]]></source>
<year>2011</year>
<conf-name><![CDATA[ International Conference on Application of Natural Language to Information Systems]]></conf-name>
<conf-loc> </conf-loc>
<page-range>238-324</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nongmeikapam]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Bandyopadhyay]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A transliteration of CRF based Manipuri POS tagging]]></article-title>
<source><![CDATA[Procedia Technology]]></source>
<year>2012</year>
<volume>6</volume>
<page-range>582-9</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shambhavi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Kumar]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Kannada part-of-speech tagging with probabilistic classifiers]]></article-title>
<source><![CDATA[International Journal of Computer Applications]]></source>
<year>2012</year>
<volume>48</volume>
<numero>17</numero>
<issue>17</issue>
<page-range>26-30</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ojha]]></surname>
<given-names><![CDATA[A.K.]]></given-names>
</name>
<name>
<surname><![CDATA[Behera]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Jha]]></surname>
<given-names><![CDATA[G.N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Training &amp; evaluation of POS taggers in Indo-Aryan languages: A case of Hindi, Odia and Bhojpuri]]></source>
<year>2015</year>
<conf-name><![CDATA[ 7th Language &amp; Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>524-9</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ghosh]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Ghosh]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Das]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part-of-speech tagging of code-mixed social media text]]></source>
<year>2016</year>
<conf-name><![CDATA[ The second workshop on computational approaches to code switching]]></conf-name>
<conf-loc> </conf-loc>
<page-range>90-7</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zeroual]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Lakhouaja]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Belahbib]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Towards a standard part of speech tagset for the Arabic language]]></article-title>
<source><![CDATA[Journal of King Saud University-Computer and Information Sciences]]></source>
<year>2017</year>
<volume>29</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>171-8</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dibitso]]></surname>
<given-names><![CDATA[M.A.]]></given-names>
</name>
<name>
<surname><![CDATA[Owolawi]]></surname>
<given-names><![CDATA[P.A.]]></given-names>
</name>
<name>
<surname><![CDATA[Ojo]]></surname>
<given-names><![CDATA[S.O.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part of speech tagging for Setswana African language]]></source>
<year>2019</year>
<conf-name><![CDATA[ International Multidisciplinary Information Technology and Engineering Conference (IMITEC)]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lafferty]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[McCallum]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Pereira]]></surname>
<given-names><![CDATA[F.C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Conditional random fields: Probabilistic models for segmenting and labeling sequence data]]></source>
<year>2001</year>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lalzarzova]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mizo Tawng Grammar &amp; Composition]]></source>
<year>2016</year>
<publisher-loc><![CDATA[Aizawl, Mizoram ]]></publisher-loc>
<publisher-name><![CDATA[K. Sangzawna]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Thangzikpuia]]></surname>
<given-names><![CDATA[P.C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mizo Tawng Grammar]]></source>
<year>2019</year>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lalhluna]]></surname>
<given-names><![CDATA[R.K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Cinque Foils &#8211; Zo &#7788;awng Grammar]]></source>
<year>2014</year>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="book">
<collab>Mizoram Board of School Education</collab>
<source><![CDATA[Mizo &#7788;awng Ziah dan]]></source>
<year>2020</year>
<page-range>1-6</page-range><publisher-name><![CDATA[IEEE]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Khiangte]]></surname>
<given-names><![CDATA[Laltluangliana]]></given-names>
</name>
</person-group>
<source><![CDATA[Thuhlaril Aizawl: College Text Book]]></source>
<year>1997</year>
<publisher-name><![CDATA[Editorial Board Publications]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pakray]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Pal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Majumder]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Resource building and parts-of-speech (POS) tagging for the Mizo language]]></source>
<year>2015</year>
<conf-name><![CDATA[ 4th Mexican International Conference on Artificial Intelligence (MICAI)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>3-7</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nunsanga]]></surname>
<given-names><![CDATA[M.V.]]></given-names>
</name>
<name>
<surname><![CDATA[Pakray]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Lalngaihtuaha]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Singh]]></surname>
<given-names><![CDATA[L.L.K.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Part-of-speech tagging in Mizo language: A preliminary study]]></article-title>
<source><![CDATA[Data Intelligence and Cognitive Informatics]]></source>
<year>2021</year>
<page-range>625-35</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
