SciELO - Scientific Electronic Library Online

 
vol.23 número3Ontological Knowledge for Rhetorical Move AnalysisPredicting and Integrating Expected Answer Types into a Simple Recurrent Neural Network Model for Answer Sentence Selection índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

GUTIERREZ-HINOJOSA, Sandra J.; CALVO, Hiram  y  MORENO-ARMENDARIZ, Marco A.. Central Embeddings for Extractive Summarization Based on Similarity. Comp. y Sist. [online]. 2019, vol.23, n.3, pp.649-663.  Epub 09-Ago-2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-23-3-3256.

In this work we propose using word embeddings combined with unsupervised methods such as clustering for the multi-document summarization task of DUC (Document Understanding Conference) 2002. We aim to find evidence that semantic information is kept in word embeddings and this representation is subject to be grouped based on their similarity, so that main ideas can be identified in sets of documents. We experiment with different clustering methods to extract candidates for the multi-document summarization task. Our experiments show that our method is able to find the prevalent ideas. ROUGE measures of our experiments are similar to the state of the art, despite the fact that not all the main ideas are found; as our method does not require annotated resources, it provides a domain and language independent way to create a summary.

Palabras llave : Extractive summarization; prevalent ideas extraction; concept similarity; central embeddings; DUC 2002.

        · texto en Inglés     · Inglés ( pdf )