SciELO - Scientific Electronic Library Online

vol.17 issue2Using Stylistic Features for Social Power ModelingGraph Mining under Linguistic Constraints for Exploring Large Texts author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand




Related links

  • Have no similar articlesSimilars in SciELO


Computación y Sistemas

Print version ISSN 1405-5546


BATTISTELLI, Delphine; CHARNOIS, Thierry; MINEL, Jean-Luc  and  TEISSEDRE, Charles. Detecting Salient Events in Large Corpora by a Combination of NLP and Data Mining Techniques. Comp. y Sist. [online]. 2013, vol.17, n.2, pp.229-237. ISSN 1405-5546.

In this paper, we present a framework and a system that extracts "salient" events relevant to a query from a large collection of documents, and which also enables events to be placed along a timeline. Each event is represented by a sentence extracted from the collection. We have conducted some experiments showing the interest of the method for this issue. Our method is based on a combination of linguistic modeling (concerning temporal adverbial meanings), symbolic natural language processing techniques (using cascades of morpho-lexical transducers) and data mining techniques (namely, sequential pattern mining under constraints). The system was applied to a corpus of newswires in French provided by the Agence France Presse (AFP). Evaluation was performed in partnership with French newswire agency journalists.

Keywords : Dates; temporal adverbials; event extraction; sequential pattern.

        · abstract in Spanish     · text in English     · English ( pdf )


Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License