SciELO - Scientific Electronic Library Online

 
vol.13 issue1Probabilistic Intelligent Systems for Thermal Power PlantsIncompressibility and Lossless Data Compression: An Approach by Pattern Discovery author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

TELLEZ VALERO, Alberto; MONTES Y GOMEZ, Manuel  and  VILLASENOR PINEDA, Luis. Using Machine Learning for Extracting Information from Natural Disaster News Reports. Comp. y Sist. [online]. 2009, vol.13, n.1, pp.33-44. ISSN 2007-9737.

The disasters caused by natural phenomena have been present all along human history; nevertheless, their consequences are greater each time. This tendency will not be reverted in the coming years; on the contrary, it is expected that natural phenomena will increase in number and intensity due to the global warming. Because of this situation it is of great interest to have sufficient data related to natural disasters, since these data are absolutely necessary to analyze their impact as well as to establish links between their occurrence and their effects. In accordance to this necessity, in this paper we describe a system based on Machine Learning methods that improves the acquisition of natural disaster data. This system automatically populates a natural disaster database by extracting information from online news reports. In particular, it allows extracting information about five different types of natural disasters: hurricanes, earthquakes, forest fires, inundations, and droughts. Experimental results on a collection of Spanish news show the effectiveness of the proposed system for detecting relevant documents about natural disasters (reaching an F-measure of 98%), as well as for extracting relevant facts to be inserted into a given database (reaching an F-measure of 76%).

Keywords : Machine Learning; Information Extraction; Text Categorization; Natural Disasters; Databases.

        · abstract in Spanish     · text in English     · English ( pdf )

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License