<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462018000401287</article-id>
<article-id pub-id-type="doi">10.13053/cys-22-4-3074</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Building Resources For Vietnamese Clinical Text Processing]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Minh]]></surname>
<given-names><![CDATA[Hiep Nguyen]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Thi Minh]]></surname>
<given-names><![CDATA[Huyen Nguyen]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[The]]></surname>
<given-names><![CDATA[Quyen Ngo]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Dalat University  ]]></institution>
<addr-line><![CDATA[Da Lat ]]></addr-line>
<country>Vietnam</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,VNU University of Science  ]]></institution>
<addr-line><![CDATA[Ha Noi ]]></addr-line>
<country>Vietnam</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<volume>22</volume>
<numero>4</numero>
<fpage>1287</fpage>
<lpage>1294</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462018000401287&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462018000401287&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462018000401287&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: Clinical texts contain textual data recorded by doctors during medical examinations. Sentences in clinical texts are generally short, narrative, not strictly adhering to Vietnamese grammar and contain many medical terms which are not present in general dictionaries. In this paper, we investigate the tasks of lexical analysis and phrase chunking for Vietnamese clinical texts. Although there exist several tools for general Vietnamese text analysis, these tools showed a limited quality in the clinical domain due to the specific grammatical style of clinical texts and the lack of medical vocabulary. Our main contributions are the construction of an annotated corpus (vnEMR) and lexical resources in the medical domain and in consequence the improvement of the quality of the tools for clinical text analysis, including word segmentation, part-of-speech tagging and chunking.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Chunking]]></kwd>
<kwd lng="en"><![CDATA[clinical text]]></kwd>
<kwd lng="en"><![CDATA[collocation]]></kwd>
<kwd lng="en"><![CDATA[lexical resources]]></kwd>
<kwd lng="en"><![CDATA[medical vocabulary]]></kwd>
<kwd lng="en"><![CDATA[POS tagging]]></kwd>
<kwd lng="en"><![CDATA[vnEMR]]></kwd>
<kwd lng="en"><![CDATA[word segmentation]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hông-Phuong]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Thi Minh Huyên]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Roussanaly]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Vinh]]></surname>
<given-names><![CDATA[H.T.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A Hybrid Approach to Word Segmentation of Vietnamese Texts]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Martín-Vide]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Otto]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Fernau]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Language and Automata Theory and Applications, LATA Lecture Notes in Computer Science]]></source>
<year>2008</year>
<volume>5196</volume>
<publisher-loc><![CDATA[Berlin, Heidelberg ]]></publisher-loc>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cam-Tu]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Xuan-Hieu]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[JVnSegmenter: A Java-based Vietnamese Word Segmentation Tool]]></source>
<year>2007</year>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dat Quoc Nguyen]]></surname>
<given-names><![CDATA[Dai Quoc Nguyen]]></given-names>
</name>
<name>
<surname><![CDATA[Dang Duc Pham]]></surname>
<given-names><![CDATA[&amp; Son Bao Pham]]></given-names>
</name>
</person-group>
<source><![CDATA[RDRPOSTagger: A Ripple Down Rules-based Part-Of-Speech Tagger]]></source>
<year>2014</year>
<conf-name><![CDATA[ Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>17-20</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dinh]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Kiem]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Toan]]></surname>
<given-names><![CDATA[N.V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Vietnamese Word Segmentation]]></source>
<year>2001</year>
<conf-name><![CDATA[ The 6th Natural Language Processing Pacific Rim Symposium]]></conf-name>
<conf-loc> </conf-loc>
<page-range>749-56</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pham]]></surname>
<given-names><![CDATA[DD.]]></given-names>
</name>
<name>
<surname><![CDATA[Tran]]></surname>
<given-names><![CDATA[GB.]]></given-names>
</name>
<name>
<surname><![CDATA[Pham]]></surname>
<given-names><![CDATA[SB.]]></given-names>
</name>
</person-group>
<source><![CDATA[A hybrid approach to Vietnamese word segmentation using part of speech tags]]></source>
<year>2009</year>
<conf-name><![CDATA[ International Conference on Knowledge]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[P.T.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[V.V.]]></given-names>
</name>
<name>
<surname><![CDATA[Le]]></surname>
<given-names><![CDATA[A.C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Vietnamese word segmentation using hidden mar-kov model]]></source>
<year>2003</year>
<conf-name><![CDATA[ International Workshop for Computer, Information, and Communication Technologies on State of the Art and Future Trends of Information techonologies in Korea and Vietnam]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[C.T.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T.K.]]></given-names>
</name>
<name>
<surname><![CDATA[Phan]]></surname>
<given-names><![CDATA[X.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[L.M]]></given-names>
</name>
<name>
<surname><![CDATA[Ha]]></surname>
<given-names><![CDATA[Q.T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Vietnamese word segmentation with CRFs and SVMs]]></source>
<year>2006</year>
<conf-name><![CDATA[ An investigation, Proceedings of the 20t PACLIC]]></conf-name>
<conf-loc> </conf-loc>
<page-range>215-22</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dinh]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Vu]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[A maximum entropy approach for Vietnamese word segmentation]]></source>
<year>2006</year>
<conf-name><![CDATA[ 4th RIVF VietNam]]></conf-name>
<conf-loc> </conf-loc>
<page-range>12-6</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dinh]]></surname>
<given-names><![CDATA[Q.T.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T.M.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Vu]]></surname>
<given-names><![CDATA[X.L.]]></given-names>
</name>
<name>
<surname><![CDATA[Rossignol]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Le-Hong]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[C.T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Word segmentation of Vietnamese texts: a comparison of approaches]]></source>
<year>2008</year>
<conf-name><![CDATA[ The Sixth International Conference on Language Resources and Evaluation]]></conf-name>
<conf-loc>Marrakech </conf-loc>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Le]]></surname>
<given-names><![CDATA[H.P]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T.M.H]]></given-names>
</name>
<name>
<surname><![CDATA[Azim]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Ho]]></surname>
<given-names><![CDATA[T.V.]]></given-names>
</name>
</person-group>
<source><![CDATA[A hybrid approach to Word Segmentation of Vietnamese texts]]></source>
<year>2008</year>
<conf-name><![CDATA[ Language and automata theory and applications 2nd international conference, LATA]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="">
<collab>VLSP project</collab>
<source><![CDATA[Vietnamese Language Processing]]></source>
<year>2012</year>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="">
<source><![CDATA[JNLP]]></source>
<year>2010</year>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Minh]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Vu-Xuan]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Le-Hong]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[S&#7917; d&#7909;ng b&#7897; gán nhãn t&#7915; lo&#7841;i xác su&#7845;t QTAG cho v&#259;n b&#7843;n ti&#7871;ng Vi&#7879;t]]></source>
<year>2003</year>
<conf-name><![CDATA[ K&#7927; y&#7871;u h&#7897;i th&#7843;o ICT.rda&#8217;03]]></conf-name>
<conf-loc>Hà N&#7897;i </conf-loc>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Le-Hong]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T. M. H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part-of-Speech Induction for Vietnamese]]></source>
<year>2013</year>
<volume>2</volume>
<conf-name><![CDATA[ The Fifth International Conference on Knowledge and Systems Engineering (KSE&#8217;13)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>261-72</page-range><publisher-name><![CDATA[Springer-Verlag]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[P.T.]]></given-names>
</name>
<name>
<surname><![CDATA[Xuan]]></surname>
<given-names><![CDATA[L.V.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T.M.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[V.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Le-Hong]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Building a Large Syntactically-Annotated Corpus of Vietnamese]]></source>
<year>2009</year>
<conf-name><![CDATA[ 3rd Linguistic Annotation Workshop]]></conf-name>
<conf-loc>Singapore </conf-loc>
<page-range>182-5</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Le-Hong]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T.M.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Rossingnol]]></surname>
<given-names><![CDATA[M.A.]]></given-names>
</name>
</person-group>
<source><![CDATA[An empirical study of maximum entropy approach for part-of-speech tagging of Vietnamese texts]]></source>
<year>2010</year>
<conf-name><![CDATA[ Actes du Traitement Automatique des Langues Naturelles (TALN&#8217;10)]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[Minh Hiep]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[Thi Minh Huyen]]></given-names>
</name>
<name>
<surname><![CDATA[Ngo]]></surname>
<given-names><![CDATA[The Quyen.]]></given-names>
</name>
</person-group>
<source><![CDATA[Nghiên c&#7913;u v&#7873; t&#7853;p t&#7915; lo&#7841;i ti&#7871;ng Vi&#7879;t s&#7917; d&#7909;ng k&#297; thu&#7853;t phân c&#7909;m]]></source>
<year>2016</year>
<conf-name><![CDATA[ K&#7927; y&#7871;u c&#7911;a H&#7897;i th&#7843;o Qu&#7889;c gia v&#7873; CNTT&amp;TT l&#7847;n th&#7913; 18]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[M.L.]]></given-names>
</name>
<name>
<surname><![CDATA[Cao]]></surname>
<given-names><![CDATA[T.H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Constructing a Vietnamese Chunking System]]></source>
<year>2008</year>
<conf-name><![CDATA[ The 4rd National Symposium on Research, Development and Application of Information and Communication Technology (ICTrda&#8217;08)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>249-57</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen Huong Thao]]></surname>
<given-names><![CDATA[Nguyen Phuong Thai]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen Le Minh]]></surname>
<given-names><![CDATA[&amp; Ha Quang Thuy]]></given-names>
</name>
</person-group>
<source><![CDATA[Vietnamese Noun Phrase Chunking based on Conditional Random Fields]]></source>
<year>2009</year>
<conf-name><![CDATA[ The first International Conference on Knowledge and Systems Engineering (KSE&#8217;09)]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brill]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging]]></article-title>
<source><![CDATA[Journal of Computational Linguistics]]></source>
<year>1995</year>
<volume>21</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>543-65</page-range></nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen Thi Minh]]></surname>
<given-names><![CDATA[Huyen]]></given-names>
</name>
<name>
<surname><![CDATA[Vu Xuan]]></surname>
<given-names><![CDATA[Luong]]></given-names>
</name>
<name>
<surname><![CDATA[Le Hong]]></surname>
<given-names><![CDATA[Phuong]]></given-names>
</name>
</person-group>
<source><![CDATA[S&#7917; d&#7909;ng b&#7897; gán nhãn t&#7915; lo&#7841;i xác su&#7845;t QTAG cho v&#259;n b&#7843;n ti&#7871;ng Vi&#7879;t]]></source>
<year>2003</year>
<conf-name><![CDATA[ K&#7927; y&#7871;u h&#7897;i th&#7843;o ICT.rda&#8217;03]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Le-Hong]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[NGUYEN]]></surname>
<given-names><![CDATA[T.M.H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part-of-Speech Induction for Vietnamese]]></source>
<year>2013</year>
<volume>2</volume>
<conf-name><![CDATA[ The Fifth International Conference on Knowledge and Systems Engineering (KSE 2013)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>261-72</page-range><publisher-name><![CDATA[Springer-Verlag]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[P.T.]]></given-names>
</name>
<name>
<surname><![CDATA[Xuan]]></surname>
<given-names><![CDATA[L.V.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[T.M.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Nguyen]]></surname>
<given-names><![CDATA[V.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Le-Hong]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Building a Large Syntactically-Annotated Corpus of Vietnamese]]></source>
<year>2009</year>
<conf-name><![CDATA[ 3rd Linguistic Annotation Workshop]]></conf-name>
<conf-loc>Singapore </conf-loc>
<page-range>182-5</page-range></nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brown]]></surname>
<given-names><![CDATA[P.F.]]></given-names>
</name>
<name>
<surname><![CDATA[Della-Pietra]]></surname>
<given-names><![CDATA[V.J.]]></given-names>
</name>
<name>
<surname><![CDATA[Desouza]]></surname>
<given-names><![CDATA[P.V.]]></given-names>
</name>
<name>
<surname><![CDATA[Lai]]></surname>
<given-names><![CDATA[J.C.]]></given-names>
</name>
<name>
<surname><![CDATA[Mercer]]></surname>
<given-names><![CDATA[R.L.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Classbased n-gram models of natural language]]></article-title>
<source><![CDATA[Computational Linguistics]]></source>
<year>1992</year>
<volume>18</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>467-79</page-range></nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Clark]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Combining distributional and morphological information for part of speech induction]]></source>
<year>2003</year>
<conf-name><![CDATA[ Proceedings of (EACL&#8217;03)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>59-66</page-range></nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jianfeng]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Joshua]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Goodman-Jiangbo]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[The Use of Clustering Techniques for Language Modeling - Application to Asian Languages]]></source>
<year>2005</year>
<publisher-loc><![CDATA[China ]]></publisher-loc>
<publisher-name><![CDATA[Microsoft Research]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Biemann]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
</person-group>
<source><![CDATA[Unsupervised Part-of-Speech Tagging Employing Efficient Graph Clustering]]></source>
<year>2011</year>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
