SciELO - Scientific Electronic Library Online

 
vol.18 issue3Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space ModelDependency vs. Constituent Based Syntactic N-Grams in Text Similarity Measures for Paraphrase Recognition author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

Print version ISSN 1405-5546

Abstract

DA CUNHA, Iria; VIVALDI, Jorge; TORRES-MORENO, Juan-Manuel  and  SIERRA, Gerardo. SIMTEX: An Approach for Detecting and Measuring Textual Similarity based on Discourse and Semantics. Comp. y Sist. [online]. 2014, vol.18, n.3, pp.505-516. ISSN 1405-5546.  http://dx.doi.org/10.13053/CyS-18-3-2033.

Nowadays automatic systems for detecting and measuring textual similarity are being developed, in order to apply them to different tasks in the field of Natural Language Processing (NLP). Currently, these systems use surface linguistic features or statistical information. Nowadays, few researchers use deep linguistic information. In this work, we present an algorithm for detecting and measuring textual similarity that takes into account information offered by discourse relations of Rhetorical Structure Theory (RST), and lexical-semantic relations included in EuroWordNet. We apply the algorithm, called SIMTEX, to texts written in Spanish, but the methodology is potentially language-independent.

Keywords : Textual similarity; discourse; semantics; paraphrase.

        · text in English     · English ( pdf )

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License