SciELO - Scientific Electronic Library Online

vol.17 issue2Single-Document Keyphrase Extraction for Multi-Document Keyphrase ExtractionExtracting Phrases Describing Problems with Products and Services from Twitter Messages author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand




Related links

  • Have no similar articlesSimilars in SciELO


Computación y Sistemas

Print version ISSN 1405-5546


JEAN-LOUIS, Ludovic; GAGNON, Michel  and  CHARTON, Eric. A Knowledge-Base Oriented Approach for Automatic Keyword Extraction. Comp. y Sist. [online]. 2013, vol.17, n.2, pp.187-196. ISSN 1405-5546.

Automatic keyword extraction is an important subfield of information extraction process. It is a difficult task, where numerous different techniques and resources have been proposed. In this paper, we propose a generic approach to extract keyword from documents using encyclopedic knowledge. Our two-step approach first relies on a classification step for identifying candidate keywords followed by a learning-to-rank method depending on a user-defined keyword profile to order the candidates. The novelty of our approach relies on i) the usage of the keyword profile ii) generic features derived from Wikipedia categories and not necessarily related to the document content. We evaluate our system on keyword datasets and corpora from standard evaluation campaign and show that our system improves the global process of keyword extraction.

Keywords : Automatic keyword extraction; encyclopedic knowledge.

        · abstract in Spanish     · text in English     · English ( pdf )


Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License