Services on Demand
Journal
Article
Indicators
- Cited by SciELO
- Access statistics
Related links
- Similars in SciELO
Share
Investigación bibliotecológica
On-line version ISSN 2448-8321Print version ISSN 0187-358X
Abstract
VILCHES-BLAZQUEZ, Luis M. and COMESANA OCAMPO, Diana. Characterization of urban risks in the press applying text mining for the enrichment of open data. Investig. bibl [online]. 2022, vol.36, n.91, pp.85-107. Epub Nov 15, 2022. ISSN 2448-8321. https://doi.org/10.22201/iibi.24488321xe.2022.91.58538.
News is freely spread and widely available to Internet users much more easily than traditional media. In the news, we can find an infinite number of hidden “minor data,” that can provide valuable information not collected in other sources of information. In this context, we have been interested in analyzing and characterizing the urban risks contained in the Uruguayan open newspapers using text mining techniques. This proposal makes it possible to create a news corpus based on risk events included in open data. The corpus covers 2003-2019 and is built from the digital open newspapers El Eco Digital, Montevideo Portal, and La Red 21. Various text mining techniques are applied to this corpus using the QDA-MinerLite software and the Python language (concretely, through the Scattertext library) to identify, characterize, and discover insights on these events. The corpus processing results help enrich the existing open data on risks in Uruguay, incorporating information on their effects, actors, and associated interventions.
Keywords : Urban Risk; Text Mining; Open Digital Newspapers; Open Data.