<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442009000200002</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[TrainQA: a Training Corpus for Corpus-Based Question Answering Systems]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Tomás]]></surname>
<given-names><![CDATA[David]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Vicedo]]></surname>
<given-names><![CDATA[José L.]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Bisbal]]></surname>
<given-names><![CDATA[Empar]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Moreno]]></surname>
<given-names><![CDATA[Lidia]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,University of Alicante Department of Software and Computing Systems ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Spain</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Technical University of Valencia Department of Information Systems and Computation ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Spain</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2009</year>
</pub-date>
<numero>40</numero>
<fpage>5</fpage>
<lpage>11</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442009000200002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442009000200002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442009000200002&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[This paper describes the development of an English corpus of factoid TREC-like question-answer pairs. The corpus obtained consists of more than 70,000 samples, containing each one the following information: a question, its question type, an exact answer to the question, the different contexts levels (sentence, paragraph and document) where the answer occurs inside a document, and a label indicating whether the answer is correct (a positive sample) or not (a negative sample). For instance, TrainQA can be used for training a binary classifier in order to decide if a given answer is correct (positive) to the question formulated or not (negative). To our knowledge, this is the first corpus aimed to train on every stage of a trainable Question Answering system: question classification, information retrieval, answer extraction and answer validation.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Question answering]]></kwd>
<kwd lng="en"><![CDATA[corpus-based systems]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="justify"><font face="verdana" size="4">Special section: Information Retrieval and Natural Language Processing</font></p>     <p align="justify"><font face="verdana" size="4">&nbsp;</font></p>     <p align="center"><font face="verdana" size="4"><b>TrainQA: a Training Corpus for Corpus&#150;Based Question Answering Systems</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="2"><b>David Tom&aacute;s<sup>1</sup>, Jos&eacute; L. Vicedo<sup>1</sup>, Empar Bisbal<sup>2</sup>, and Lidia Moreno<sup>2</sup></b></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><i><sup>1</sup> Department of Software and Computing Systems, University of Alicante, Spain.</i> (<a href="mailto:dtomas@dlsi.ua.es">dtomas@dlsi.ua.es</a>, <a href="mailto:vicedo@dlsi.ua.es">vicedo@dlsi.ua.es</a>)</font></p>     <p align="justify"><font face="verdana" size="2"><i><sup>2</sup> Department of Information Systems and Computation, Technical University of Valencia, Spain.</i> (<a href="mailto:ebisbal@dsic.upv.es">ebisbal@dsic.upv.es</a>, <a href="mailto:lmoreno@dsic.upv.es">lmoreno@dsic.upv.es</a>)</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2">Manuscript received November 23, 2008.     ]]></body>
<body><![CDATA[<br> Manuscript accepted for publication August 15, 2009.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>     <p align="justify"><font face="verdana" size="2">This paper describes the development of an English corpus of factoid TREC&#150;like question&#150;answer pairs. The corpus obtained consists of more than 70,000 samples, containing each one the following information: a question, its question type, an exact answer to the question, the different contexts levels (sentence, paragraph and document) where the answer occurs inside a document, and a label indicating whether the answer is correct (a positive sample) or not (a negative sample). For instance, TrainQA can be used for training a binary classifier in order to decide if a given answer is correct (positive) to the question formulated or not (negative). To our knowledge, this is the first corpus aimed to train on every stage of a trainable Question Answering system: question classification, information retrieval, answer extraction and answer validation.</font></p>     <p align="justify"><font face="verdana" size="2"><b>Key words: </b>Question answering, corpus&#150;based systems.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><a href="/pdf/poli/n40/n40a2.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>ACKNOWLEDGEMENT</b></font></p>     <p align="justify"><font face="verdana" size="2">This work has been developed in the framework of the project CICYT R2D2 (TIC2003&#150;07158&#150;C04).</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>REFERENCES</b></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; M. P. Marcus, B. Santorini, and M. A. Marcinkiewicz, "Building a large annotated corpus of english: The penn treebank," <i>Computational Linguistics, </i>vol. 19, no. 2, pp. 313&#150;330, 1994.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049763&pid=S1870-9044200900020000200001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; D. Ravichandran, A. Ittycheriah, and S. Roukos, "Automatic derivation of surface text patterns for a maximum entropy based question answering system," in <i>NAACL '03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology. </i>Morristown, NJ, USA: Association for Computational Linguistics, 2003, pp. 85&#150;87.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049765&pid=S1870-9044200900020000200002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;3&#93; R. Soricut and E. Brill, "Automatic question answering using the web: Beyond the factoid," <i>Information Retrieval, </i>vol. 9, no. 2, pp. 191&#150;206, 2006.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049767&pid=S1870-9044200900020000200003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;4&#93; E. Agichtein, S. Lawrence, and L. Gravano, "Learning search engine specific query transformations for question answering," in WWW <i>'01: Proceedings of the 10th international conference on World Wide Web. </i>New York, NY, USA: ACM, 2001, pp. 169&#150;178.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049769&pid=S1870-9044200900020000200004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;5&#93; R. D. Burke, K. J. Hammond, V. A. Kulyukin, S. L. Lytinen, N. Tomuro, and S. Schoenberg, "Question answering from frequently asked question files: Experiences with the faq finder system," Chicago, IL, USA, Tech. Rep., 1997.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049771&pid=S1870-9044200900020000200005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; A. Berger, R. Caruana, D. Cohn, D. Freitag, and V. Mittal, "Bridging the lexical chasm: statistical approaches to answer&#150;finding," in <i>SIGIR '00: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval. </i>New York, NY, USA: ACM, 2000, pp. 192&#150;199.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049773&pid=S1870-9044200900020000200006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; E. Bisbal, D. Tomas, L. Moreno, J. L. Vicedo, and A. Suarez, "A multilingual svm&#150;based question classification system," in <i>MICAI 2005: Advances in Artificial Intelligence, 4th Mexican International Conference on Artificial Intelligence, </i>ser. Lecture Notes in Computer Science, A. F. Gelbukh, A. de Albornoz, and H. Terashima&#150;Mar&iacute;n, Eds., vol. 3789. Springer, November 2005, pp. 806&#150;815.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049775&pid=S1870-9044200900020000200007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; I. Dagan, O. Glickman, and B. Magnini, "Recognizing textual entailment," in <i>PASCAL Proceedings of the First Challenge Workshop, </i>Southampton, UK, April 2005, pp. 1&#150;8.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049777&pid=S1870-9044200900020000200008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;9&#93; E. M. Voorhees, "The trec&#150;8 question answering track report," in <i>Eighth </i><i>Text REtrieval Conference, </i>ser. NIST Special Publication, vol. 500&#150;246. Gaithersburg, USA: National Institute of Standards and Technology, November 1999, pp. 77&#150;82.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049779&pid=S1870-9044200900020000200009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; J. C. Reynar and A. Ratnaparkhi, "A maximum entropy approach to identifying sentence boundaries," in <i>Proceedings of the fifth conference on Applied natural language processing. </i>Morristown, NJ, USA: Association for Computational Linguistics, 1997, pp. 16&#150;19.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049781&pid=S1870-9044200900020000200010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; J. L. Fleiss, "Measuring nominal scale agreement among many raters," <i>Psychological Bulletin, </i>vol. 76, no. 5, pp. 378&#150;382, 1971.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6049783&pid=S1870-9044200900020000200011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Marcus]]></surname>
<given-names><![CDATA[M. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Santorini]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Marcinkiewicz]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Building a large annotated corpus of english: The penn treebank]]></article-title>
<source><![CDATA[Computational Linguistics]]></source>
<year>1994</year>
<volume>19</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>313-330</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ravichandran]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Ittycheriah]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Roukos]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic derivation of surface text patterns for a maximum entropy based question answering system]]></article-title>
<source><![CDATA[]]></source>
<year></year>
<conf-name><![CDATA[ Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology]]></conf-name>
<conf-date>2003</conf-date>
<conf-loc>Morristown NJ</conf-loc>
<page-range>85-87</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Soricut]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Brill]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic question answering using the web: Beyond the factoid]]></article-title>
<source><![CDATA[Information Retrieval]]></source>
<year>2006</year>
<volume>9</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>191-206</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agichtein]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Lawrence]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Gravano]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Learning search engine specific query transformations for question answering]]></article-title>
<source><![CDATA[]]></source>
<year></year>
<conf-name><![CDATA[ WWW '01: Proceedings of the 10th international conference on World Wide Web]]></conf-name>
<conf-date>2001</conf-date>
<conf-loc>New York NY</conf-loc>
<page-range>169-178</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Burke]]></surname>
<given-names><![CDATA[R. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Hammond]]></surname>
<given-names><![CDATA[K. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Kulyukin]]></surname>
<given-names><![CDATA[V. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Lytinen]]></surname>
<given-names><![CDATA[S. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Tomuro]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Schoenberg]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Question answering from frequently asked question files: Experiences with the faq finder system]]></source>
<year>1997</year>
<publisher-loc><![CDATA[Chicago^eIL IL]]></publisher-loc>
<publisher-name><![CDATA[Tech. Rep]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Berger]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Caruana]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Cohn]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Freitag]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Mittal]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Bridging the lexical chasm: statistical approaches to answer-finding]]></article-title>
<source><![CDATA[]]></source>
<year></year>
<conf-name><![CDATA[ SIGIR '00: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval]]></conf-name>
<conf-date>2000</conf-date>
<conf-loc>New York NY</conf-loc>
<page-range>192-199</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bisbal]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Tomas]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Moreno]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Vicedo]]></surname>
<given-names><![CDATA[J. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Suarez]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A multilingual svm-based question classification system]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Albornoz]]></surname>
<given-names><![CDATA[A. de]]></given-names>
</name>
<name>
<surname><![CDATA[Terashima-Marín]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[MICAI 2005: Advances in Artificial Intelligence, 4th Mexican International Conference on Artificial Intelligence]]></source>
<year>Nove</year>
<month>mb</month>
<day>er</day>
<volume>3789</volume>
<page-range>806-815</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dagan]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Glickman]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Magnini]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Recognizing textual entailment]]></article-title>
<source><![CDATA[PASCAL Proceedings of the First Challenge Workshop]]></source>
<year>Apri</year>
<month>l </month>
<day>20</day>
<page-range>1-8</page-range><publisher-loc><![CDATA[Southampton ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Voorhees]]></surname>
<given-names><![CDATA[E. M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The trec-8 question answering track report]]></article-title>
<source><![CDATA[Eighth Text REtrieval Conference]]></source>
<year>Nove</year>
<month>mb</month>
<day>er</day>
<volume>500-246</volume>
<page-range>77-82</page-range><publisher-loc><![CDATA[Gaithersburg ]]></publisher-loc>
<publisher-name><![CDATA[National Institute of Standards and Technology]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Reynar]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Ratnaparkhi]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A maximum entropy approach to identifying sentence boundaries]]></article-title>
<source><![CDATA[Proceedings of the fifth conference on Applied natural language processing]]></source>
<year>1997</year>
<page-range>16-19</page-range><publisher-loc><![CDATA[Morristown^eNJ NJ]]></publisher-loc>
<publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fleiss]]></surname>
<given-names><![CDATA[J. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Measuring nominal scale agreement among many raters]]></article-title>
<source><![CDATA[Psychological Bulletin]]></source>
<year>1971</year>
<volume>76</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>378-382</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
