<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442013000200009</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[A POS Tagger for Social Media Texts Trained on Web Comments]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Neunerdt]]></surname>
<given-names><![CDATA[Melanie]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Reyer]]></surname>
<given-names><![CDATA[Michael]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Mathar]]></surname>
<given-names><![CDATA[Rudolf]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Rheinisch-Westfaelische Technische Hochschule Aachen University Institute for Theoretical Information Technology ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Germany</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2013</year>
</pub-date>
<numero>48</numero>
<fpage>61</fpage>
<lpage>68</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442013000200009&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442013000200009&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442013000200009&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Using social media tools such as blogs and forums have become more and more popular in recent years. Hence, a huge collection of social media texts from different communities is available for accessing user opinions, e.g., for marketing studies or acceptance research. Typically, methods from Natural Language Processing are applied to social media texts to automatically recognize user opinions. A fundamental component of the linguistic pipeline in Natural Language Processing is Part-of-Speech tagging. Most state-of-the-art Part-of-Speech taggers are trained on newspaper corpora, which differ in many ways from non-standardized social media text. Hence, applying common taggers to such texts results in performance degradation. In this paper, we present extensions to a basic Markov model tagger for the annotation of social media texts. Considering the German standard Stuttgart/Tübinger TagSet (STTS), we distinguish 54 tag classes. Applying our approach improves the tagging accuracy for social media texts considerably, when we train our model on a combination of annotated texts from newspapers and Web comments.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Natural language processing]]></kwd>
<kwd lng="en"><![CDATA[part-of-speech tagging]]></kwd>
<kwd lng="en"><![CDATA[opinion mining]]></kwd>
<kwd lng="en"><![CDATA[German]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[   	    <p align="center"><font face="verdana" size="4"><b>A POS Tagger for Social Media Texts Trained on Web Comments</b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="2"><b>Melanie Neunerdt, Michael Reyer, and Rudolf Mathar</b></font></p>  	    <p align="justify"><font face="verdana" size="2"><i>    <br> 	The authors are with the Institute for Theoretical Information Technology, RWTH Aachen University, Germany (e&#45;mail: </i><a href="mailto:neunerdt@ti.rwth-aachen.de">neunerdt@ti.rwth&#45;aachen.de</a>, <a href="mailto:reyer@ti.rwth-aachen.de">reyer@ti.rwth&#45;aachen.de</a>, <a href="mailto:mathar@ti.rwth-aachen.de">mathar@ti.rwth&#45;aachen.de</a><i>).</i></font></p>      <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2">Manuscript received on August 2, 2013.    <br> 	Accepted for publication on September 30, 2013.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Using social media tools such as blogs and forums have become more and more popular in recent years. Hence, a huge collection of social media texts from different communities is available for accessing user opinions, e.g., for marketing studies or acceptance research. Typically, methods from <i>Natural Language Processing</i> are applied to social media texts to automatically recognize user opinions. A fundamental component of the linguistic pipeline in <i>Natural Language Processing</i> is <i>Part&#45;of&#45;Speech</i> tagging. Most state&#45;of&#45;the&#45;art <i>Part&#45;of&#45;Speech</i> taggers are trained on newspaper corpora, which differ in many ways from non&#45;standardized social media text. Hence, applying common taggers to such texts results in performance degradation. In this paper, we present extensions to a basic Markov model tagger for the annotation of social media texts. Considering the German standard <i>Stuttgart/T&uuml;binger TagSet</i> (STTS), we distinguish 54 tag classes. Applying our approach improves the tagging accuracy for social media texts considerably, when we train our model on a combination of annotated texts from newspapers and Web comments.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Key words:</b> Natural language processing, part&#45;of&#45;speech tagging, opinion mining, German.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font><font face="verdana" size="2">&nbsp;</font></p>         <p align="justify"><font face="verdana" size="2"><a href="/pdf/poli/n48/n48a9.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>         <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Acknowledgment</b></font></p>  	    <p align="justify"><font face="verdana" size="2">This work was partially supported by the Project House HumTec at RWTH Aachen University, Germany. We would like to thank Phillip VaBen for his contribution.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>References</b></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; K. Toutanova, D. Klein, C. D. Manning, and Y. Singer, "Feature&#45;rich Part&#45;of&#45;Speech Tagging With a Cyclic Dependency Network," in <i>Proceedings of Human Language Technology Conference,</i> 2003, pp. 173&#45;180.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050945&pid=S1870-9044201300020000900001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; P. Gadde, L. V. Subramaniam, and T. A. Faruquie, "Adapting a WSJ Trained Part&#45;of&#45;Speech Tagger to Noisy Text: Preliminary Results," in <i>Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data,</i> 2011, pp. 5:1&#45;5:8.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050947&pid=S1870-9044201300020000900002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;3&#93; H. Schmid, "Probabilistic Part&#45;of&#45;Speech Tagging Using Decision Trees," in <i>Proceedings of International Conference on New Methods in Language Processing,</i> 1994, pp. 44&#45;49.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050949&pid=S1870-9044201300020000900003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;4&#93; &#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;, "Improvements in Part&#45;of&#45;Speech Tagging With an Application to German," in <i>Proceedings of the ACL SIGDAT&#45;Workshop,</i> 1995, pp. 47&#45;50.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050951&pid=S1870-9044201300020000900004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;5&#93; T. Brants, "TnT &#45; A Statistical Part&#45;of&#45;Speech Tagger," in <i>Proceedings of the 6th Applied Natural Language Processing Conference,</i> 2000, pp. 224&#45;231.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050953&pid=S1870-9044201300020000900005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; A. Schiller, S. Teufel, C. St&oacute;ckert, and C. Thielen, "Guidelines f&uuml;r das Tagging deutscher Textcorpora mit STTS," 1999, university of Stuttgart.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050955&pid=S1870-9044201300020000900006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; J. Gim&eacute;nez and L. M&aacute;rquez, "Svmtool: A General POS Tagger Generator Based on Support Vector Machines," in <i>Proceedings of the 4th International Conference on Language Resources and Evaluation,</i> 2004, pp. 43-46.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050957&pid=S1870-9044201300020000900007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; H. Schmid, "Part&#45;of&#45;Speech Tagging With Neural Networks," in <i>Proceedings of the 15th Conference on Computational Linguistics,</i> 1994, pp. 172&#45;176.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050959&pid=S1870-9044201300020000900008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;9&#93; M. Volk and G. Schneider, "Comparing a statistical and a rule&#45;based tagger for German," in <i>Proceedings of the 4th Conference on Natural Language Processing,</i> 1998, pp. 125&#45;137.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050961&pid=S1870-9044201300020000900009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; E. Giesbrecht and S. Evert, "Is Part&#45;of&#45;Speech Tagging a Solved Task? An Evaluation of POS Taggers for the German Web as Corpus," in <i>Proceedings of the Fifth Web as Corpus Workshop,</i> 2009, pp. 27&#45;35.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050963&pid=S1870-9044201300020000900010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; A. Mikheev, "Automatic Rule Induction for Unknown Word Guessing," <i>Computational Linguistics,</i> vol. 23, pp. 405&#45;423, 1997.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050965&pid=S1870-9044201300020000900011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;12&#93; H. Schtitze, "Distributional Part&#45;of&#45;Speech Tagging," in <i>Proceedings of 7th Conference of the European Chapter of the Association for Computational Linguistics,</i> 1995, pp. 141&#45;148.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050967&pid=S1870-9044201300020000900012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;13&#93; O. Owoputi, B. O'Connor, C. Dyer, K. Gimpel, and N. Schneider, "Part&#45;of&#45;Speech Tagging for Twitter: Word Clusters and Other Advances," School of Computer Science, Carnegie Mellon University, Tech. Rep., 2012.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050969&pid=S1870-9044201300020000900013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;14&#93; K. Gimpel, N. Schneider, B. O'Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flanigan, and N. A. Smith, "Part&#45;of&#45;Speech tagging for Twitter: annotation, features, and experiments," in <i>Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics,</i> 2011, pp. 42&#45;47.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050971&pid=S1870-9044201300020000900014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;15&#93; M. Neunerdt, M. Reyer, and R. Mathar, "Part&#45;of&#45;Speech Tagging for Social Media Texts," in <i>Proceedings of The International Conference of the German Society for Computational Linguistics and Language Technology,</i> 2013.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050973&pid=S1870-9044201300020000900015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;16&#93; B. Trevisan, M. Neunerdt, and E.&#45;M. Jakobs, "A Multi&#45;level Annotation Model for Fine&#45;grained Opinion Detection in German Blog Comments," in <i>Proceedings of KONVENS 2012,</i> 2012, pp. 179&#45;188.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050975&pid=S1870-9044201300020000900016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;17&#93; M. BeiBwenger, M. Ermakova, A. Geyken, L. Lemnitzer, and A. Storrer, "A TEI Schema for the Representation of Computer&#45;mediated Communication," <i>Journal of the Text Encoding Initiative,</i> no. 3, pp. 1&#45;31, 2012. &#91;Online&#93;. Available: <a href="http://jtei.revues.org/476" target="_blank">http://jtei.revues.org/476</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050977&pid=S1870-9044201300020000900017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;18&#93; J. R. Quinlan, "Induction of Decision Trees," <i>Machine Learning,</i> pp. 81&#45;106, 1986.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050978&pid=S1870-9044201300020000900018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;19&#93; S. Brants, S. Dipper, P. Eisenberg, S. Hansen&#45;Schirra, E. K&oacute;nig, W. Lezius, C. Rohrer, G. Smith, and H. Uszkoreit, "TIGER: Linguistic Interpretation of a German Corpus," <i>Research on Language &amp; Computation,</i> pp. 597&#45;620, 2004.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050980&pid=S1870-9044201300020000900019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;20&#93; M. BeiBwenger, "Corpora zur computervermittelten (internetbasierten) Kommunikation," <i>Zeitschrift f&uuml;r germanistische Linguistik,</i> vol. 35, pp. 496&#45;503, 2007.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6050982&pid=S1870-9044201300020000900020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Toutanova]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Klein]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Singer]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Feature-rich Part-of-Speech Tagging With a Cyclic Dependency Network]]></article-title>
<source><![CDATA[Proceedings of Human Language Technology Conference]]></source>
<year>2003</year>
<page-range>173-180</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gadde]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Subramaniam]]></surname>
<given-names><![CDATA[L. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Faruquie]]></surname>
<given-names><![CDATA[T. A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Adapting a WSJ Trained Part-of-Speech Tagger to Noisy Text: Preliminary Results]]></article-title>
<source><![CDATA[Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data]]></source>
<year>2011</year>
<page-range>5:1-5:8</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schmid]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Probabilistic Part-of-Speech Tagging Using Decision Trees]]></article-title>
<source><![CDATA[Proceedings of International Conference on New Methods in Language Processing]]></source>
<year>1994</year>
<page-range>44-49</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schmid]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Improvements in Part-of-Speech Tagging With an Application to German]]></article-title>
<source><![CDATA[Proceedings of the ACL SIGDAT-Workshop]]></source>
<year>1995</year>
<page-range>47-50</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brants]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[TnT - A Statistical Part-of-Speech Tagger]]></article-title>
<source><![CDATA[Proceedings of the 6th Applied Natural Language Processing Conference]]></source>
<year>2000</year>
<page-range>224-231</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schiller]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Teufel]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Stóckert]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Thielen]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Guidelines für das Tagging deutscher Textcorpora mit STTS]]></source>
<year>1999</year>
<publisher-name><![CDATA[university of Stuttgart]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Giménez]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Márquez]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Svmtool: A General POS Tagger Generator Based on Support Vector Machines]]></article-title>
<source><![CDATA[Proceedings of the 4th International Conference on Language Resources and Evaluation]]></source>
<year>2004</year>
<page-range>43-46</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schmid]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Part-of-Speech Tagging With Neural Networks]]></article-title>
<source><![CDATA[Proceedings of the 15th Conference on Computational Linguistics]]></source>
<year>1994</year>
<page-range>172-176</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Volk]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Schneider]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Comparing a statistical and a rule-based tagger for German]]></article-title>
<source><![CDATA[Proceedings of the 4th Conference on Natural Language Processing]]></source>
<year>1998</year>
<page-range>125-137</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Giesbrecht]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Evert]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Is Part-of-Speech Tagging a Solved Task? An Evaluation of POS Taggers for the German Web as Corpus]]></article-title>
<source><![CDATA[Proceedings of the Fifth Web as Corpus Workshop]]></source>
<year>2009</year>
<page-range>27-35</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mikheev]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic Rule Induction for Unknown Word Guessing]]></article-title>
<source><![CDATA[Computational Linguistics]]></source>
<year>1997</year>
<volume>23</volume>
<page-range>405-423</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schtitze]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Distributional Part-of-Speech Tagging]]></article-title>
<source><![CDATA[Proceedings of 7th Conference of the European Chapter of the Association for Computational Linguistics]]></source>
<year>1995</year>
<page-range>141-148</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Owoputi]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[O'Connor]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Dyer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Gimpel]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Schneider]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Part-of-Speech Tagging for Twitter: Word Clusters and Other Advances]]></source>
<year>2012</year>
<publisher-name><![CDATA[School of Computer Science, Carnegie Mellon University]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gimpel]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Schneider]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[O'Connor]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Das]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Mills]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Eisenstein]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Heilman]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Yogatama]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Flanigan]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Smith]]></surname>
<given-names><![CDATA[N. A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Part-of-Speech tagging for Twitter: annotation, features, and experiments]]></article-title>
<source><![CDATA[Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics]]></source>
<year>2011</year>
<page-range>42-47</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Neunerdt]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Reyer]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Mathar]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Part-of-Speech Tagging for Social Media Texts]]></article-title>
<source><![CDATA[Proceedings of The International Conference of the German Society for Computational Linguistics and Language Technology]]></source>
<year>2013</year>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Trevisan]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Neunerdt]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Jakobs]]></surname>
<given-names><![CDATA[E.-M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A Multi-level Annotation Model for Fine-grained Opinion Detection in German Blog Comments]]></article-title>
<source><![CDATA[Proceedings of KONVENS 2012]]></source>
<year>2012</year>
<page-range>179-188</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[BeiBwenger]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ermakova]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Geyken]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Lemnitzer]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Storrer]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A TEI Schema for the Representation of Computer-mediated Communication]]></article-title>
<source><![CDATA[Journal of the Text Encoding Initiative]]></source>
<year>2012</year>
<numero>3</numero>
<issue>3</issue>
<page-range>1-31</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Quinlan]]></surname>
<given-names><![CDATA[J. R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Induction of Decision Trees]]></article-title>
<source><![CDATA[Machine Learning]]></source>
<year>1986</year>
<page-range>81-106</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brants]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Dipper]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Eisenberg]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Hansen-Schirra]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Kónig]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Lezius]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Rohrer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Smith]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Uszkoreit]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[TIGER: Linguistic Interpretation of a German Corpus]]></article-title>
<source><![CDATA[Research on Language & Computation]]></source>
<year>2004</year>
<page-range>597-620</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[BeiBwenger]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="de"><![CDATA[Corpora zur computervermittelten (internetbasierten) Kommunikation]]></article-title>
<source><![CDATA[Zeitschrift für germanistische Linguistik]]></source>
<year>2007</year>
<volume>35</volume>
<page-range>496-503</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
