SciELO - Scientific Electronic Library Online

 
vol.18 issue3Vector Space Basis Change in Information RetrievalEntity Extraction in Biochemical Text using Multiobjective Optimization author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

Print version ISSN 1405-5546

Abstract

LITVAK, Marina  and  VANETIK, Natalia. Multi-document Summarization using Tensor Decomposition. Comp. y Sist. [online]. 2014, vol.18, n.3, pp.581-589. ISSN 1405-5546.  http://dx.doi.org/10.13053/CyS-18-3-2026.

The problem of extractive text summarization for a collection of documents is defined as selecting a small subset of sentences so the contents and meaning of the original document set are preserved in the best possible way. In this paper we present a new model for the problem of extractive summarization, where we strive to obtain a summary that preserves the information coverage as much as possible, when compared to the original document set. We construct a new tensor-based representation that describes the given document set in terms of its topics. We then rank topics via Tensor Decomposition, and compile a summary from the sentences of the highest ranked topics.

Keywords : Tensor decomposition; multilingual multi-focument summarization.

        · text in English     · English ( pdf )

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License