<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462016000300405</article-id>
<article-id pub-id-type="doi">10.13053/cys-20-3-2453</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[Social Media - Processing Romanian Chat and Discourse Analysis]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[M&#259;r&#259;nduc]]></surname>
<given-names><![CDATA[C&#259;t&#259;lina]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
<xref ref-type="aff" rid="Aaf"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Perez]]></surname>
<given-names><![CDATA[Cenel-Augusto]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Simionescu]]></surname>
<given-names><![CDATA[Radu]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Cuza University Faculty of Computer Science ]]></institution>
<addr-line><![CDATA[Ia&#537;i ]]></addr-line>
<country>Romania</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Academic Institute of Linguistics Iorgu Iordan  ]]></institution>
<addr-line><![CDATA[Bucharest ]]></addr-line>
<country>Romania</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>09</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>09</month>
<year>2016</year>
</pub-date>
<volume>20</volume>
<numero>3</numero>
<fpage>405</fpage>
<lpage>414</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462016000300405&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462016000300405&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462016000300405&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract. In order to obtain a balanced corpus, a sub-corpus of 2,576 sentences illustrating contemporary social media language has been added to the Dependency Treebank for Romanian. The texts were taken from the chat. The subject of this paper is to describe the second step of processing non-standard texts with a hybrid POS-tagger for Romanian and with a Malt parser, both until now trained on standard language and on other styles of communication. The results obtained show that the UAIC tools are comparable with the tools for other languages trained on similar corpora. Another purpose is to develop this resource, the Dependency Treebank for Romanian, not only quantitatively, doubling its dimension in a year, but also changing its format with a new one, compatible with other similar foreign corpora, and adding new, more complex annotation layers. A semantic layer and a discursive annotation will be added, permitting the study of discursive and conversational particularities. Finally, examples illustrating discursive particularities of the chat communication are discussed.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Conversational particularities]]></kwd>
<kwd lng="en"><![CDATA[dependency treebank]]></kwd>
<kwd lng="en"><![CDATA[discourse analysis]]></kwd>
<kwd lng="en"><![CDATA[processing non-standard texts]]></kwd>
<kwd lng="en"><![CDATA[social-media communication]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Developing a Part-of-Speech Tagger for Dutch Tweets]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Avontuur]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Balemans]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Elshof]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[van Noord]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[van Zaanen]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computational Linguistics in the Netherlands Journal]]></source>
<year>2012</year>
<volume>2</volume>
<page-range>34-51</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cristea]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Motivations and Implications of Veins Theory]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Sharp]]></surname>
<given-names><![CDATA[Bernadette]]></given-names>
</name>
</person-group>
<source><![CDATA[Natural Language Understanding and Cognitive Science]]></source>
<year>2005</year>
<conf-name><![CDATA[ 2nd International Workshop on Natural Language Understanding and Cognitive Science]]></conf-name>
<conf-loc>Miami, U.S.A. </conf-loc>
<page-range>32-44</page-range><publisher-loc><![CDATA[Portugal ]]></publisher-loc>
<publisher-name><![CDATA[INSTICC Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cristea]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[The Right Frontier Constraint Holds Unconditionally]]></source>
<year>2005</year>
<conf-name><![CDATA[ Approaches to Discourse (MAD'05)]]></conf-name>
<conf-loc>Berlin, Germany </conf-loc>
</nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Parsing the Twitter verse]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dent]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Alto]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Diep]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<source><![CDATA[Scientific American]]></source>
<year>2011</year>
<volume>305</volume>
<page-range>22</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Darling]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Paul]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Song]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<source><![CDATA[Unsupervised Part-of-Speech Tagging in Noisy and Esoteric Domains with a Syntactic-Semantic Bayesian HMM]]></source>
<year>2012</year>
<conf-name><![CDATA[ Conference of the European Chapter of the Association for Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-9</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Derczynski]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Maynard]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Aswani]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Bontcheva]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Microblog-Genre Noise and Impact on Semantic Annotation Accuracy]]></source>
<year>2013</year>
<conf-name><![CDATA[ 24th ACM Conference on Hypertext and Social Media]]></conf-name>
<conf-loc> </conf-loc>
<page-range>21-30</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Foster]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Cetinoglu]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Wagner]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Le Roux]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Hogan]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Nivre]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Hogan]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[van Genabith]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[POS Tagging and Parsing the Twitter verse]]></source>
<year>2011</year>
<conf-name><![CDATA[ AAAI Workshop on Analyzing Microtext]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gadde]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Subramaniam]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Faruquie]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Adapting a wsj Trained Part-of-Speech Tagger to Noisy Text: Preliminary Results]]></source>
<year>2011</year>
<conf-name><![CDATA[ Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gimpel]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Schneider]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[O&#8217;Connor]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Das]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Mills]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Eisenstein]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Heilman]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Yogatama]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Flanigan]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Smith]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments]]></source>
<year>2011</year>
<conf-name><![CDATA[ 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies]]></conf-name>
<conf-loc> </conf-loc>
<page-range>42-7</page-range><publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Weng]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Jiang]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
</person-group>
<source><![CDATA[A Broad Coverage Normalization System for Social Media Language]]></source>
<year>2012</year>
<volume>1</volume>
<conf-name><![CDATA[ 50th Annual Meeting of the Association for Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1035-44</page-range></nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Neunerdt]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Trevisan]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Reyer]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Mathar]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Gurevich]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Biemann]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Zesch]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part of Speech Tagging for Social Media Texts]]></source>
<year>2013</year>
<page-range>139-50</page-range><publisher-loc><![CDATA[Berlin ]]></publisher-loc>
<publisher-name><![CDATA[Springer Verlag]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A POS Tagger for Social Media Texts trained on Web Comments]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Neunerdt]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Reyer]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Mathar]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Polibits]]></source>
<year>2013</year>
<volume>48</volume>
<page-range>59-66</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nivre]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Hall]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Nilsson]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[MaltParser: A Data-Driven Parser-Generator for Dependency Parsing]]></source>
<year>2006</year>
<conf-name><![CDATA[ Fifth International Conference on Language Resources and Evaluation]]></conf-name>
<conf-date>2006</conf-date>
<conf-loc>Genoa, Italy </conf-loc>
<page-range>2216-9</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Owoputi]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[O&#8217;Connor]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Dyer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Gimpel]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Schneider]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part-of-Speech Tagging for Twitter: Word Clusters and Other Advances]]></source>
<year>2012</year>
<publisher-name><![CDATA[Machine Learning Department]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Perez]]></surname>
<given-names><![CDATA[C.A.]]></given-names>
</name>
<name>
<surname><![CDATA[M&#259;r&#259;nduc]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Simionescu]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Trandab&#259;&#355;]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Gîfu]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Including Social Media - a Very Dynamic Style, in the Corpora for Processing Romanian Language]]></source>
<year>2016</year>
<conf-name><![CDATA[ EUROLAN 2015]]></conf-name>
<conf-loc> </conf-loc>
<page-range>139-53</page-range><publisher-loc><![CDATA[Switzerland ]]></publisher-loc>
<publisher-name><![CDATA[Springer International Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="book">
<article-title xml:lang=""><![CDATA[Hybrid POS Tagger]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Simionescu]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[The Workshop on Language Resources and Tools in Industrial Applications]]></source>
<year>2011</year>
<publisher-name><![CDATA[Eurolan]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Part of Speech Tagging in Manipuri: A Rule-based Approach]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Singha]]></surname>
<given-names><![CDATA[K.R.]]></given-names>
</name>
<name>
<surname><![CDATA[Purkayastha]]></surname>
<given-names><![CDATA[B.S.]]></given-names>
</name>
<name>
<surname><![CDATA[Singha]]></surname>
<given-names><![CDATA[K.D.]]></given-names>
</name>
</person-group>
<source><![CDATA[International Journal of Computer Applications]]></source>
<year>2012</year>
<volume>51</volume>
<numero>14</numero>
<issue>14</issue>
</nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Toutanova]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Klein]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Singer]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network]]></source>
<year>2003</year>
<conf-name><![CDATA[ Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology]]></conf-name>
<conf-loc> </conf-loc>
<page-range>173-80</page-range><publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
