<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462009000300004</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Using Machine Learning for Extracting Information from Natural Disaster News Reports]]></article-title>
<article-title xml:lang="es"><![CDATA[Usando Aprendizaje Automático para Extraer Información de Noticias de Desastres Naturales]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Téllez Valero]]></surname>
<given-names><![CDATA[Alberto]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Montes y Gómez]]></surname>
<given-names><![CDATA[Manuel]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Villaseñor Pineda]]></surname>
<given-names><![CDATA[Luis]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE) Coordinación de Ciencias Computacionales Laboratorio de Tecnologías del Lenguaje]]></institution>
<addr-line><![CDATA[Tonantzintla Puebla]]></addr-line>
<country>México</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>09</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>09</month>
<year>2009</year>
</pub-date>
<volume>13</volume>
<numero>1</numero>
<fpage>33</fpage>
<lpage>44</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462009000300004&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462009000300004&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462009000300004&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[The disasters caused by natural phenomena have been present all along human history; nevertheless, their consequences are greater each time. This tendency will not be reverted in the coming years; on the contrary, it is expected that natural phenomena will increase in number and intensity due to the global warming. Because of this situation it is of great interest to have sufficient data related to natural disasters, since these data are absolutely necessary to analyze their impact as well as to establish links between their occurrence and their effects. In accordance to this necessity, in this paper we describe a system based on Machine Learning methods that improves the acquisition of natural disaster data. This system automatically populates a natural disaster database by extracting information from online news reports. In particular, it allows extracting information about five different types of natural disasters: hurricanes, earthquakes, forest fires, inundations, and droughts. Experimental results on a collection of Spanish news show the effectiveness of the proposed system for detecting relevant documents about natural disasters (reaching an F-measure of 98%), as well as for extracting relevant facts to be inserted into a given database (reaching an F-measure of 76%).]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[Los desastres causados por fenómenos naturales han estado presentes desde el principio de la historia del hombre; sin embargo, sus consecuencias son cada vez mayores. Esta tendencia podría no ser revertida en los próximos años; al contrario, se espera que los fenómenos naturales puedan incrementar en número e intensidad debido al calentamiento global. A causa de esta situación es de gran interés tener suficientes datos relacionados a los desastres naturales, ya que estos datos son absolutamente necesarios para analizar su impacto así como para establecer conexiones entre su ocurrencia y sus efectos. En correspondencia con esta necesidad, en este artículo describimos un sistema basado en métodos de Aprendizaje Automático que mejora la adquisición de datos de desastres naturales. Este sistema automáticamente llena una base de datos de desastres naturales con la información extraída de noticias de periódicos en línea. En particular, este sistema permite extraer información acerca de cinco tipos de desastres naturales: huracanes, temblores, incendios forestales, inundaciones y sequías. Los resultados experimentales en una colección de noticias en Español muestran la eficacia del sistema propuesto tanto para detectar documentos relevantes sobre desastres naturales (alcanzando una medida-F de 98%), así como para extraer hechos relevantes para ser insertados en una base de datos dada (alcanzando una medida-F de 76%).]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Machine Learning]]></kwd>
<kwd lng="en"><![CDATA[Information Extraction]]></kwd>
<kwd lng="en"><![CDATA[Text Categorization]]></kwd>
<kwd lng="en"><![CDATA[Natural Disasters]]></kwd>
<kwd lng="en"><![CDATA[Databases]]></kwd>
<kwd lng="es"><![CDATA[Aprendizaje Automático]]></kwd>
<kwd lng="es"><![CDATA[Extracción de Información]]></kwd>
<kwd lng="es"><![CDATA[Clasificación Temática de Textos]]></kwd>
<kwd lng="es"><![CDATA[Desastres Naturales]]></kwd>
<kwd lng="es"><![CDATA[Bases de Datos]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="justify"><font face="verdana" size="4">Art&iacute;culos</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="4"><b>Using Machine Learning for Extracting Information from Natural Disaster News Reports</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="3"><b><i>Usando Aprendizaje Autom&aacute;tico para Extraer Informaci&oacute;n de Noticias de Desastres Naturales</i></b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="2"><b>Alberto T&eacute;llez Valero, Manuel Montes y G&oacute;mez and Luis Villase&ntilde;or Pineda</b></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><i>Laboratorio de Tecnolog&iacute;as del Lenguaje, Coordinaci&oacute;n de Ciencias Computacionales, Instituto Nacional de Astrof&iacute;sica, &Oacute;ptica y Electr&oacute;nica (INAOE). Luis Enrique Erro #1, Tonantzintla, Puebla, M&eacute;xico; <a href="mailto:albertotellezv@ccc.inaoep.mx">albertotellezv@ccc.inaoep.mx</a> , <a href="mailto:mmontesg@ccc.inaoep.mx">mmontesg@ccc.inaoep.mx</a> , <a href="mailto:villasen@ccc.inaoep.mx">villasen@ccc.inaoep.mx</a></i></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">Article received on July 17, 2008    <br> Accepted on April 03, 2009</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>     <p align="justify"><font face="verdana" size="2">The disasters caused by natural phenomena have been present all along human history; nevertheless, their consequences are greater each time. This tendency will not be reverted in the coming years; on the contrary, it is expected that natural phenomena will increase in number and intensity due to the global warming. Because of this situation it is of great interest to have sufficient data related to natural disasters, since these data are absolutely necessary to analyze their impact as well as to establish links between their occurrence and their effects. In accordance to this necessity, in this paper we describe a system based on Machine Learning methods that improves the acquisition of natural disaster data. This system automatically populates a natural disaster database by extracting information from online news reports. In particular, it allows extracting information about five different types of natural disasters: hurricanes, earthquakes, forest fires, inundations, and droughts. Experimental results on a collection of Spanish news show the effectiveness of the proposed system for detecting relevant documents about natural disasters (reaching an F&#150;measure of 98%), as well as for extracting relevant facts to be inserted into a given database (reaching an F&#150;measure of 76%).</font></p>     <p align="justify"><font face="verdana" size="2"><b>Keywords: </b>Machine Learning, Information Extraction, Text Categorization, Natural Disasters, Databases.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Resumen.</b></font></p>     <p align="justify"><font face="verdana" size="2">Los desastres causados por fen&oacute;menos naturales han estado presentes desde el principio de la historia del hombre; sin embargo, sus consecuencias son cada vez mayores. Esta tendencia podr&iacute;a no ser revertida en los pr&oacute;ximos a&ntilde;os; al contrario, se espera que los fen&oacute;menos naturales puedan incrementar en n&uacute;mero e intensidad debido al calentamiento global. A causa de esta situaci&oacute;n es de gran inter&eacute;s tener suficientes datos relacionados a los desastres naturales, ya que estos datos son absolutamente necesarios para analizar su impacto as&iacute; como para establecer conexiones entre su ocurrencia y sus efectos. En correspondencia con esta necesidad, en este art&iacute;culo describimos un sistema basado en m&eacute;todos de Aprendizaje Autom&aacute;tico que mejora la adquisici&oacute;n de datos de desastres naturales. Este sistema autom&aacute;ticamente llena una base de datos de desastres naturales con la informaci&oacute;n extra&iacute;da de noticias de peri&oacute;dicos en l&iacute;nea. En particular, este sistema permite extraer informaci&oacute;n acerca de cinco tipos de desastres naturales: huracanes, temblores, incendios forestales, inundaciones y sequ&iacute;as. Los resultados experimentales en una colecci&oacute;n de noticias en Espa&ntilde;ol muestran la eficacia del sistema propuesto tanto para detectar documentos relevantes sobre desastres naturales (alcanzando una medida&#150;F de 98%), as&iacute; como para extraer hechos relevantes para ser insertados en una base de datos dada (alcanzando una medida&#150;F de 76%).</font></p>     <p align="justify"><font face="verdana" size="2"><b>Palabras claves: </b>Aprendizaje Autom&aacute;tico, Extracci&oacute;n de Informaci&oacute;n, Clasificaci&oacute;n Tem&aacute;tica de Textos, Desastres Naturales, Bases de Datos.</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><a href="/pdf/cys/v13n1/v13n1a4.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Acknowledgments</b></font></p>     <p align="justify"><font face="verdana" size="2">This work was partially supported by Conacyt through research grants (CB&#150;61335, CB&#150;82050 and CB&#150;83459) and scholarship (171610).</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>References</b></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">1. <b>Bouckaert, R. </b>(2002). "Low level information extraction". In <i>Proceedings of the workshop on Text Learning </i>(TextML&#150;2002), Sydney, Australia.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047737&pid=S1405-5546200900030000400001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">2. <b>Cowie, J. and Lehnert, W. </b>(1998). "Information Extraction". <i>Communications of the ACM, </i>Vol. 39, No. 1, pp. 80&#150;91</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047738&pid=S1405-5546200900030000400002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">3. <b>Freitag, D.  </b>(1998). "Machine Learning for Information Extraction in Informal Domains". <i>Ph.d.  thesis, </i>Computer Science Department, Carnegie Mellon University.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047739&pid=S1405-5546200900030000400003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">4. <b>Hobbs, J. R. </b>(1992). "The Generic Information Extraction System". In B. Sundheim, editor. <i>Fourth Message Understanding Conference (MUC&#150;4), </i>Mc Lean, Virginia, June. Distributed by Morgan Kauffman Publishers, Inc., San Mateo, California.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047740&pid=S1405-5546200900030000400004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">5. <b>Ireson, N., Ciravega, F., Califf, M. E., Freitag, D., Kushmerick, N., and Labelli, A. </b>(2005). "Evaluating Machine Learning for Information Extraction", In <i>Proceedings of the 22<sup>nd</sup> International Conference on Machine Learning, </i>Bonn, Germany.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047741&pid=S1405-5546200900030000400005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">6. <b>Jackson, P. &amp; Moulinier, I. </b>(2007). "Natural Language Processing for Online applications: text retrieval, extraction and categorization". John Benjamins Publishing Co, second edition, June.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047742&pid=S1405-5546200900030000400006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">7. <b>Joachins, T. </b>(2002). "Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms". Kluwer Academic Publishers, May.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047743&pid=S1405-5546200900030000400007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">8. <b>Kushmerick, N., Johnston, E., and McGuinness, S. </b>(2001). "Information Extraction by Text Classification". <i>Seventeenth International Join Conference on Artificial Intelligence (IJCAI&#150;2001), </i>N. Kushmerick Ed. Adaptive Text Extraction and Mining (Working Notes), Seattle, Washington , pp. 44&#150;50.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047744&pid=S1405-5546200900030000400008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">9. <b>Li, Y., Bontcheva, K., and Cunningham, H.  </b>(2005). "SVM Based Learning System for Information Extraction". In <i>Proceedings of Sheffield Machine Learning Workshop, </i>Lecture Notes in Computer Science. Springer Verlag.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047745&pid=S1405-5546200900030000400009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">10. <b>Mitchell, T. </b>(1997). "Machine Learning". McGraw Hill.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047746&pid=S1405-5546200900030000400010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">11. <b>Moens M. </b>(2006). "Information Extraction: Algorithms and Prospects in a Retrieval Context".  Springer (Information retrieval series, edited by W. Bruce Croft), October.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047747&pid=S1405-5546200900030000400011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">12. <b>Muslea, I. </b>(1999). "Extraction Patterns for Information Extractions Tasks: A Survey". In <i>Proceedings of the AAAI Workshop on Machine Learning for Information Extraction, </i>July, Orlando, Florida.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047748&pid=S1405-5546200900030000400012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">13. <b>Peng, F. </b>(1999). "Models Development in IE Tasks &#150; A survey". CS685 (Intelligent Computer Interface) course project, Computer Science Department, University of Waterloo.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047749&pid=S1405-5546200900030000400013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">14. <b>Riloff, E.</b> (1996). "Automatically Generating Extraction Patterns from untagged text". In <i>Proceedings of the 13th National Conference on Artificial Intelligence (AAAI), </i>pp. 1044&#150;1049.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047750&pid=S1405-5546200900030000400014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">15. <b>Riloff, E.  &amp; Jeffrey L.  </b>(1999). "Extraction&#150;based text categorization:  Generating domain&#150;specific role relationships automatically". In Tomek Strzalkowski (Ed.), <i>Natural Language Information Retrieval </i>(pp. 167&#150;196). Dordrecht, The Netherlands: Kluwer Academic Publishers.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047751&pid=S1405-5546200900030000400015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">16. <b>Roth, D. &amp; Yih, W. </b>(2001). "Relational Learning Via Propositional Algorithms: An Information Extraction Case Study". In <i>Proceedings of the 15th International Conference on Artificial Intelligence (IJCA&#150;01I), </i>Morgan Kauffman Publisher, Inc., San Francisco, California, pp. 1257&#150;1263.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047752&pid=S1405-5546200900030000400016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">17. <b>Salzberg, S. L. </b>(1999). "On Comparing Classifiers: A Critique of Current Research and Methods". <i>Data Mining and Knowledge Discovery, </i>1:1&#150;12.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047753&pid=S1405-5546200900030000400017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">18. <b>Scheffer T., Decomain C., &amp; Wrobel S. </b>(2001). "Active hidden Markov models for information extraction". <i>Lecture Notes in Computer Science, </i>Vol. 2189, Springer, pp. 309&#150;318.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047754&pid=S1405-5546200900030000400018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">19. <b>Sebastiani, F. (2002). </b>"Machine Learning in Automated Text Categorization". <i>ACM Computing Surveys. </i>34(1): 1&#150;47.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047755&pid=S1405-5546200900030000400019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">20. <b>Seymore, K., McCallum, A., &amp; Rosenfeld, R. </b>(1999). "Learning Hidden Markov Model structure for Information Extraction". In <i>Proceedings of the 20th National Conference on Artificial Intelligence (AAAI), </i>pp. 37&#150;42.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047756&pid=S1405-5546200900030000400020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">21. <b>Sonderland, S., Fisher, D., Aseltine, J., &amp; Lehnert, W. </b>(1995). "CRYSTAL: Inducing a Conceptual Dictionary". In <i>Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI), </i>pp. 1314&#150;1321.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047757&pid=S1405-5546200900030000400021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">22. <b>Sonderland, S. </b>(1999). "Learning Information Extraction Rules for Semi&#150;Structured and Free Text". <i>Machine Learning, </i>No. 34, pp. 233&#150;272.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047758&pid=S1405-5546200900030000400022&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">23. <b>Stevenson  M.  &amp;  Greenwood  M. A.  </b>(2006). "Comparing Information Extraction Pattern Models",  In <i>Proceedings of the Workshop on Information Extraction Beyond The Document, </i>Association for Computational Linguistics, Sydney, pp. 12&#150;19.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047759&pid=S1405-5546200900030000400023&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">24. <b>Turno, J.  </b>(2003). "Information Extraction, Multilinguality and Portability". <i>Revista Iberoamericana de Inteligencia Artificial, </i>No. 22, pp. 57&#150;78.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047760&pid=S1405-5546200900030000400024&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">25. <b>Zavrel, J., Berck, P., &amp; Lavrijssen, W. </b>(2000). "Information Extraction by Text Classification: Corpus Mining for Features". In <i>Proceedings of the workshop Information Extraction meets Corpus Linguistics, </i>Athens, Greece.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2047761&pid=S1405-5546200900030000400025&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bouckaert]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Low level information extraction"]]></article-title>
<source><![CDATA[]]></source>
<year>2002</year>
<conf-name><![CDATA[ Proceedings of the workshop on Text Learning]]></conf-name>
<conf-date>2002</conf-date>
<conf-loc>Sydney </conf-loc>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cowie]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Lehnert]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Information Extraction"]]></article-title>
<source><![CDATA[Communications of the ACM]]></source>
<year>1998</year>
<volume>39</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>80-91</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Freitag]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA["Machine Learning for Information Extraction in Informal Domains"]]></source>
<year>1998</year>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hobbs]]></surname>
<given-names><![CDATA[J. R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["The Generic Information Extraction System"]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Sundheim]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[]]></source>
<year>1992</year>
<conf-name><![CDATA[ Fourth Message Understanding Conference (MUC-4)]]></conf-name>
<conf-loc>Mc Lean Virginia</conf-loc>
<publisher-loc><![CDATA[an Mateo^eCalifornia California]]></publisher-loc>
<publisher-name><![CDATA[Morgan Kauffman Publishers, Inc.]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ireson]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Ciravega]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Califf]]></surname>
<given-names><![CDATA[M. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Freitag]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Kushmerick]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Labelli]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Evaluating Machine Learning for Information Extraction"]]></article-title>
<source><![CDATA[]]></source>
<year></year>
<conf-name><![CDATA[22nd International Conference on Machine Learning]]></conf-name>
<conf-loc>Bonn </conf-loc>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jackson]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Moulinier]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<source><![CDATA["Natural Language Processing for Online applications: text retrieval, extraction and categorization"]]></source>
<year>2007</year>
<edition>second</edition>
<publisher-name><![CDATA[John Benjamins Publishing Co]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Joachins]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA["Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms"]]></source>
<year>2002</year>
<publisher-name><![CDATA[Kluwer Academic Publishers]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kushmerick]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Johnston]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[McGuinness]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Information Extraction by Text Classification"]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Kushmerick]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[]]></source>
<year>2001</year>
<conf-name><![CDATA[Seventeenth International Join Conference on Artificial Intelligence]]></conf-name>
<conf-date>2001</conf-date>
<conf-loc> </conf-loc>
<page-range>44-50</page-range><publisher-loc><![CDATA[Seattle^eWashington Washington]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Bontcheva]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Cunningham]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["SVM Based Learning System for Information Extraction"]]></article-title>
<source><![CDATA[]]></source>
<year>2005</year>
<conf-name><![CDATA[ Proceedings of Sheffield Machine Learning Workshop]]></conf-name>
<conf-loc> </conf-loc>
<publisher-name><![CDATA[Springer Verlag]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mitchell]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA["Machine Learning"]]></source>
<year>1997</year>
<publisher-name><![CDATA[McGraw Hill]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moens]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA["Information Extraction: Algorithms and Prospects in a Retrieval Context"]]></source>
<year>2006</year>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Muslea]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Extraction Patterns for Information Extractions Tasks: A Survey"]]></article-title>
<source><![CDATA[]]></source>
<year>1999</year>
<conf-name><![CDATA[ Proceedings of the AAAI Workshop on Machine Learning for Information Extraction]]></conf-name>
<conf-loc> </conf-loc>
<publisher-loc><![CDATA[Orlando^eFlorida Florida]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Peng]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<source><![CDATA["Models Development in IE Tasks - A survey"]]></source>
<year>1999</year>
<volume>CS685</volume>
<publisher-name><![CDATA[Computer Science Department, University of Waterloo]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Riloff]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Automatically Generating Extraction Patterns from untagged text"]]></article-title>
<source><![CDATA[]]></source>
<year>1996</year>
<conf-name><![CDATA[13th National Conference on Artificial Intelligence]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1044-1049</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Riloff]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Jeffrey]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Extraction-based text categorization: Generating domain-specific role relationships automatically"]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Strzalkowski]]></surname>
<given-names><![CDATA[Tomek]]></given-names>
</name>
</person-group>
<source><![CDATA[Natural Language Information Retrieval]]></source>
<year>1999</year>
<page-range>167-196</page-range><publisher-loc><![CDATA[Dordrecht ]]></publisher-loc>
<publisher-name><![CDATA[Kluwer Academic Publishers]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Roth]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Yih]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Relational Learning Via Propositional Algorithms: An Information Extraction Case Study"]]></article-title>
<source><![CDATA[]]></source>
<year>2001</year>
<conf-name><![CDATA[15th International Conference on Artificial Intelligence (IJCA-01I)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1257-1263</page-range><publisher-loc><![CDATA[San Francisco^eCalifornia California]]></publisher-loc>
<publisher-name><![CDATA[Morgan Kauffman Publisher, Inc.]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Salzberg]]></surname>
<given-names><![CDATA[S. L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["On Comparing Classifiers: A Critique of Current Research and Methods"]]></article-title>
<source><![CDATA[Data Mining and Knowledge Discovery]]></source>
<year>1999</year>
<volume>1</volume>
<page-range>1-12</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Scheffer]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Decomain]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Wrobel]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA["Active hidden Markov models for information extraction"]]></source>
<year>2001</year>
<volume>2189</volume>
<page-range>309-318</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sebastiani]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Machine Learning in Automated Text Categorization"]]></article-title>
<source><![CDATA[ACM Computing Surveys]]></source>
<year>2002</year>
<volume>34</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>1-47</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Seymore]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[McCallum]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Rosenfeld]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Learning Hidden Markov Model structure for Information Extraction"]]></article-title>
<source><![CDATA[]]></source>
<year>1999</year>
<conf-name><![CDATA[20th National Conference on Artificial Intelligence]]></conf-name>
<conf-loc> </conf-loc>
<page-range>37-42</page-range></nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sonderland]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Fisher]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Aseltine]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Lehnert]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["CRYSTAL: Inducing a Conceptual Dictionary"]]></article-title>
<source><![CDATA[]]></source>
<year>1995</year>
<conf-name><![CDATA[14th International Joint Conference on Artificial Intelligence (IJCAI)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1314-1321</page-range></nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sonderland]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Learning Information Extraction Rules for Semi-Structured and Free Text"]]></article-title>
<source><![CDATA[Machine Learning]]></source>
<year>1999</year>
<numero>34</numero>
<issue>34</issue>
<page-range>233-272</page-range></nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Stevenson]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Greenwood]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Comparing Information Extraction Pattern Models"]]></article-title>
<source><![CDATA[]]></source>
<year>2006</year>
<conf-name><![CDATA[ Proceedings of the Workshop on Information Extraction Beyond The Document]]></conf-name>
<conf-loc>Sydney </conf-loc>
<page-range>12-19</page-range></nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Turno]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Information Extraction, Multilinguality and Portability"]]></article-title>
<source><![CDATA[Revista Iberoamericana de Inteligencia Artificial]]></source>
<year>2003</year>
<numero>22</numero>
<issue>22</issue>
<page-range>57-78</page-range></nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zavrel]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Berck]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Lavrijssen]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA["Information Extraction by Text Classification: Corpus Mining for Features"]]></article-title>
<source><![CDATA[]]></source>
<year>2000</year>
<conf-name><![CDATA[ Proceedings of the workshop Information Extraction meets Corpus Linguistics]]></conf-name>
<conf-loc>Athens </conf-loc>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
