<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462022000301333</article-id>
<article-id pub-id-type="doi">10.13053/cys-26-3-4355</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Towards an Automatic Mark-up of Rhetorical Structure in Student Essays]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Bick]]></surname>
<given-names><![CDATA[Eckhard]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,University of Southern Denmark Institute of Language and Communication ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Denmark</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>09</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>09</month>
<year>2022</year>
</pub-date>
<volume>26</volume>
<numero>3</numero>
<fpage>1333</fpage>
<lpage>1342</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462022000301333&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462022000301333&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462022000301333&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: This paper presents and discusses a discourse relation annotation scheme for the MUCH corpus of academic writing, based on Rhetorical Structure Theory (RST). The set of proposed relational tags takes into regard both distinctiveness, pedagogical needs and implementability with automatic rules. We show how a pilot grammar with 180 rules can map discourse relations between existing syntactic nodes, exploiting lower-level grammatical/treebank markup and surface clues such as connectives (e.g., conjunctions and prepositions). In an evaluation of a live run on student essays from teacher training courses, the average false positive rate across the most frequent 21 categories was 26.7% for tags and 17.1% for relation links. Performance was best for categories with a high percentage of rules using surface connectives and, for in-sentence relations, their corresponding dependency links.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Rhetorical structure theory]]></kwd>
<kwd lng="en"><![CDATA[discourse annotation]]></kwd>
<kwd lng="en"><![CDATA[student essays]]></kwd>
<kwd lng="en"><![CDATA[MUCH corpus]]></kwd>
<kwd lng="en"><![CDATA[constraint grammar]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ädel]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Metadiscourse in L1 and L2]]></article-title>
<source><![CDATA[Studies in Corpus Linguistics]]></source>
<year>2006</year>
<volume>24</volume>
<publisher-name><![CDATA[John Benjamins Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bick]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Towards a semantic annotation of English television news - building and evaluating a constraint grammar FrameNet]]></source>
<year>2012</year>
<conf-name><![CDATA[ 26th Pacific Asia Conference on Language, Information and Computation]]></conf-name>
<conf-loc> </conf-loc>
<page-range>60-9</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bick]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Didriksen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[CG-3 - Beyond classical constraint grammar]]></source>
<year>2015</year>
<conf-name><![CDATA[ 20th Nordic Conference of Computational Linguistics (NODALIA)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>31-9</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bick]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[DanProof: Pedagogical spell and grammar checking for Danish]]></source>
<year>2015</year>
<conf-name><![CDATA[ International Conference Recent Advances in Natural Language Processing (RANLP)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>55-62</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carlson]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Marcu]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Discourse tagging reference manual]]></article-title>
<source><![CDATA[ISI Technical Report ISI-TR-545]]></source>
<year>2001</year>
<volume>54</volume>
<publisher-name><![CDATA[Information Science Institute]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Da Cunha]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Torres-Moreno]]></surname>
<given-names><![CDATA[J. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Sierra]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[On the development of the RST Spanish treebank]]></source>
<year>2011</year>
<conf-name><![CDATA[ 5th Linguistic Annotation Workshop]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-10</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Eriksson]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Finnegan]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Kauppinen]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Wiktorsson]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Wärnsby]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Withers]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[MUCH: The Malmö University-Chalmers Corpus of academic writing as a process]]></source>
<year>2012</year>
<conf-name><![CDATA[ 10th Teaching and Language Corpora Conference (TALC10)]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Flowerdew]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Using corpora for writing instruction]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[O&#8217;Keeffe]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[McCarthy]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[The Routledge Handbook of Corpus Linguistics]]></source>
<year>2010</year>
<page-range>444-57</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Forbes-Riley]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Litman]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Extracting PDTB discourse relations from student essays]]></source>
<year>2016</year>
<conf-name><![CDATA[ 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>117-27</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Iruskieta]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Aranzabe]]></surname>
<given-names><![CDATA[M. J.]]></given-names>
</name>
<name>
<surname><![CDATA[de Ilarraza]]></surname>
<given-names><![CDATA[A. D.]]></given-names>
</name>
<name>
<surname><![CDATA[González-Dios]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Lersundi]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[de Lacalle]]></surname>
<given-names><![CDATA[O. L.]]></given-names>
</name>
</person-group>
<source><![CDATA[The RST Basque treebank: an online search interface to check rhetorical relations]]></source>
<year>2013</year>
<conf-name><![CDATA[ 4th Workshop RST and Discourse Studies]]></conf-name>
<conf-loc> </conf-loc>
<page-range>40-9</page-range></nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Karlsson]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Voutilainen]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Heikkilä]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Anttila]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Constraint grammar: A language-independent system for parsing unrestricted text]]></source>
<year>1995</year>
<page-range>1-88</page-range><publisher-name><![CDATA[Mouton de Gruyer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mann]]></surname>
<given-names><![CDATA[W. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Thompson]]></surname>
<given-names><![CDATA[S. A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Rhetorical structure theory: Toward a functional theory of text organization]]></article-title>
<source><![CDATA[TEXT &#8211; Interdisciplinary Journal for the Study of Discourse]]></source>
<year>1988</year>
<volume>8</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>243-81</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Marcu]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Amorrortu]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Romera]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Experiments in constructing a corpus of discourse trees]]></source>
<year>1999</year>
<conf-name><![CDATA[ ACL Workshop on Standards and Tools for Discourse Tagging]]></conf-name>
<conf-loc> </conf-loc>
<page-range>48-57</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pardo]]></surname>
<given-names><![CDATA[T. A. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Nunes]]></surname>
<given-names><![CDATA[G. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Rino]]></surname>
<given-names><![CDATA[L. H. M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[DiZer: An automatic discourse analyzer for Brazilian Portuguese]]></article-title>
<source><![CDATA[Advances in artificial Intelligence&#8211;SBIA, Lecture Notes in Computer Science]]></source>
<year>2004</year>
<volume>3171</volume>
<page-range>224-34</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="journal">
<collab>The PDTB Research Group</collab>
<article-title xml:lang=""><![CDATA[The Penn discourse treebank 2.0 annotation manual]]></article-title>
<source><![CDATA[Technical Report IRCS-08-01]]></source>
<year>2008</year>
<publisher-name><![CDATA[Institute for Research in Cognitive Science, University of Pennsylvania]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pitler]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Raghupathy]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Mehta]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Nenkova]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Joshi]]></surname>
<given-names><![CDATA[A. K.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Easily identifiable discourse relations]]></article-title>
<source><![CDATA[Technical Reports (CIS)]]></source>
<year>2008</year>
<publisher-name><![CDATA[Institute for Research in Cognitive Science, University of Pennsylvania]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Prasad]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Dinesh]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Miltsakaki]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Robaldo]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Joshi]]></surname>
<given-names><![CDATA[A. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Webber]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[The penn discourse treebank 2.0]]></source>
<year>2008</year>
<conf-name><![CDATA[ 6th International Conference on Language Resources and Evaluation (LREC)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>2961-8</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Stede]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[The Potsdam commentary corpus]]></source>
<year>2004</year>
<conf-name><![CDATA[ Workshop on Discourse Annotation]]></conf-name>
<conf-loc> </conf-loc>
<page-range>96-102</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Maite]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Renkema]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Discourse relations reference Corpus [Corpus]]]></source>
<year>2008</year>
<publisher-name><![CDATA[Simon Fraser University and Tilburg University]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wärnsby]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Kauppinen]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Eriksson]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Wiktorsson]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bick]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Olsson]]></surname>
<given-names><![CDATA[L. -J.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Building interdisciplinary bridges - MUCH: The Malmö University-Chalmers Corpus of academic writing as a process]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Olga]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Gardner]]></surname>
<given-names><![CDATA[A. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Honkapohja]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Chevalier]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[New Approaches to English Linguistics: Building Bridges]]></source>
<year>2016</year>
<page-range>197-211</page-range><publisher-name><![CDATA[John Benjamins Publishing]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
