<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462016000400647</article-id>
<article-id pub-id-type="doi">10.13053/cys-20-4-2506</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Semantic Textual Similarity Methods, Tools, and Applications: A Survey]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Majumder]]></surname>
<given-names><![CDATA[Goutam]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Pakray]]></surname>
<given-names><![CDATA[Partha]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[Alexander]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Pinto]]></surname>
<given-names><![CDATA[David]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,National Institute of Technology Mizoram  ]]></institution>
<addr-line><![CDATA[Aizawl ]]></addr-line>
<country>India</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Instituto Politecnico Nacional CIC ]]></institution>
<addr-line><![CDATA[Mexico City ]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Benemerita Universidad Autonoma de Puebla Faculty of Computer Science ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Mexico</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2016</year>
</pub-date>
<volume>20</volume>
<numero>4</numero>
<fpage>647</fpage>
<lpage>665</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462016000400647&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462016000400647&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462016000400647&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract Measuring Semantic Textual Similarity (STS), between words/ terms, sentences, paragraph and document plays an important role in computer science and computational linguistic. It also has many applications over several fields such as Biomedical Informatics and Geoinformation. In this paper, we present a survey on different methods of textual similarity and we also reported about the availability of different software and tools those are useful for STS. In natural language processing (NLP), STS is a important component for many tasks such as document summarization, word sense disambiguation, short answer grading, information retrieval and extraction. We split out the measures for semantic similarity into three broad categories such as (i) Topological/Knowledge-based (ii) Statistical/ Corpus Based (iii) String based. More emphasis is given to the methods related to the WordNet taxonomy. Because topological methods, plays an important role to understand intended meaning of an ambiguous word, which is very difficult to process computationally. We also propose a new method for measuring semantic similarity between sentences. This proposed method, uses the advantages of taxonomy methods and merge these information to a language model. It considers the WordNet synsets for lexical relationships between nodes/words and a uni-gram language model is implemented over a large corpus to assign the information content value between the two nodes of different classes.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[WordNet taxonomy]]></kwd>
<kwd lng="en"><![CDATA[natural language processing]]></kwd>
<kwd lng="en"><![CDATA[semantic textual similarity]]></kwd>
<kwd lng="en"><![CDATA[information content]]></kwd>
<kwd lng="en"><![CDATA[random walk]]></kwd>
<kwd lng="en"><![CDATA[statistical similarity]]></kwd>
<kwd lng="en"><![CDATA[cosine similarity]]></kwd>
<kwd lng="en"><![CDATA[term-based similarity]]></kwd>
<kwd lng="en"><![CDATA[character-based similarity]]></kwd>
<kwd lng="en"><![CDATA[n-gram]]></kwd>
<kwd lng="en"><![CDATA[Jaccard similarity]]></kwd>
<kwd lng="en"><![CDATA[WordNet similarity]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agirre]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Banea]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cardie]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cer]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Diab]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez-Agirre]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Guo]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Mihalcea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Rigau]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Wiebe]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Semeval-2014 task 10: Multilingual semantic textual similarity]]></source>
<year>2014</year>
<conf-name><![CDATA[ 8international workshop on semantic evaluation (SemEval 2014)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>81-91</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agirre]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Cer]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Diab]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez-Agirre]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Guo]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<source><![CDATA[sem 2013 shared task: Semantic textual similarity, including a pilot on typed-similarity]]></source>
<year>2013</year>
<conf-name><![CDATA[ SEM 2013: The Second Joint Conference on Lexical and Computational Semantics]]></conf-name>
<conf-loc> </conf-loc>
<publisher-name><![CDATA[Citeseer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agirre]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Diab]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Cer]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez-Agirre]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Semeval-2012 task 6: A pilot on semantic textual similarity]]></source>
<year>2012</year>
<conf-name><![CDATA[ First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation]]></conf-name>
<conf-loc> </conf-loc>
<page-range>385-93</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A new sentence similarity measure and sentence based extractive technique for automatic text summarization]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Aliguliyev]]></surname>
<given-names><![CDATA[R. M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Expert Systems with Applications]]></source>
<year>2009</year>
<volume>36</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>7764-72</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ando]]></surname>
<given-names><![CDATA[R. K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Latent semantic space: iterative scaling improves precision ofinterdocument similarity measurement]]></source>
<year>2000</year>
<conf-name><![CDATA[ 23annual international ACM SIGIR conference on Research and development in information retrieval]]></conf-name>
<conf-loc> </conf-loc>
<page-range>216-23</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bagrow]]></surname>
<given-names><![CDATA[J. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Ben-Avraham]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[On the Google-fame of scientists and other populations]]></source>
<year>2005</year>
<publisher-name><![CDATA[arXiv]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Geographic knowledge extraction and semantic similarity in openstreetmap]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ballatore]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Bertolotto]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Wilson]]></surname>
<given-names><![CDATA[D. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Knowledge and Information Systems]]></source>
<year>2013</year>
<volume>37</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>61-81</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bard]]></surname>
<given-names><![CDATA[G. V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Spelling-error tolerant, order-independent pass-phrases via the Damerau-Levenshtein string edit distance metric]]></source>
<year>2007</year>
<conf-name><![CDATA[ fifth Australasian symposium on ACSW frontiers-Volume 68]]></conf-name>
<conf-loc> </conf-loc>
<page-range>117-24</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Barron-Cedeno]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Rosso]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Agirre]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Labaka]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Plagiarism detection across distant language pairs]]></source>
<year>2010</year>
<conf-name><![CDATA[ 23International Conference on Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>37-45</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[The google similarity distance]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cilibrasi]]></surname>
<given-names><![CDATA[R. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Vitanyi]]></surname>
<given-names><![CDATA[P. M.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Transactions on knowledge and data engineering]]></source>
<year>2007</year>
<volume>19</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>370-83</page-range></nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Image retrieval using multiple evidence ranking]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Coelho]]></surname>
<given-names><![CDATA[T. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Calado]]></surname>
<given-names><![CDATA[P. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Souza]]></surname>
<given-names><![CDATA[L. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Ribeiro-Neto]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Muntz]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Transactions on knowledge and data engineering]]></source>
<year>2004</year>
<volume>16</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>408-17</page-range></nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Data integration using similarity joins and a word-based information representation language]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cohen]]></surname>
<given-names><![CDATA[W. W.]]></given-names>
</name>
</person-group>
<source><![CDATA[ACM Transactions on Information Systems (TOIS)]]></source>
<year>2000</year>
<volume>18</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>288-321</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Corley]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Mihalcea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Measuring the semantic similarity of texts]]></source>
<year>2005</year>
<conf-name><![CDATA[ ACL workshop on empirical modeling of semantic equivalence and entailment]]></conf-name>
<conf-loc> </conf-loc>
<page-range>13-8</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Measures of the amount of ecologic association between species]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dice]]></surname>
<given-names><![CDATA[L. R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Ecology]]></source>
<year>1945</year>
<volume>26</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>297-302</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Semantic similarity for automatic classification of chemical compounds]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ferreira]]></surname>
<given-names><![CDATA[J. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Couto]]></surname>
<given-names><![CDATA[F. M.]]></given-names>
</name>
</person-group>
<source><![CDATA[PLoS Comput Biol]]></source>
<year>2010</year>
<volume>6</volume>
<numero>9</numero>
<issue>9</issue>
<page-range>e1000937</page-range></nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fouss]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Pirotte]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Renders]]></surname>
<given-names><![CDATA[J. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Saerens]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Transactions on knowledge and data engineering]]></source>
<year>2007</year>
<volume>19</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>355-69</page-range></nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Computing semantic relatedness using wikipedia-based explicit semantic analysis]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gabrilovich]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Markovitch]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[IJcAI]]></source>
<year>2007</year>
<volume>7</volume>
<page-range>1606-11</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A unified approach to automatic indexing and information retrieval]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ginsberg]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Expert: Intelligent Systems and Their Applications]]></source>
<year>1993</year>
<volume>8</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>46-56</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[The piazza peer data management system]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Halevy]]></surname>
<given-names><![CDATA[A. Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Ives]]></surname>
<given-names><![CDATA[Z. G.]]></given-names>
</name>
<name>
<surname><![CDATA[Madhavan]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Mork]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Suciu]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Tatarinov]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Transactions on knowledge and data engineering]]></source>
<year>2004</year>
<volume>16</volume>
<numero>7</numero>
<issue>7</issue>
<page-range>787-98</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Harispe]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Ranwez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Janaqi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Montmain]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Bioinformatics]]></source>
<year>2014</year>
<volume>30</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>740-2</page-range></nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Information retrieval based on conceptual distance in is-a hierarchies]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ho Lee]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Ho Kim]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Joon Lee]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Journal of documentation]]></source>
<year>1993</year>
<volume>49</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>188-207</page-range></nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hughes]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Ramage]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Lexical semantic relatedness with random graph walks]]></source>
<year>2007</year>
<conf-name><![CDATA[ EMNLP-CoNLL]]></conf-name>
<conf-loc> </conf-loc>
<page-range>581-9</page-range></nlm-citation>
</ref>
<ref id="B23">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Semantic text similarity using corpus-based word similarity and string similarity]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Islam]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Inkpen]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[ACM Transactions on Knowledge Discovery from Data (TKDD)]]></source>
<year>2008</year>
<volume>2</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>10</page-range></nlm-citation>
</ref>
<ref id="B24">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jaccard]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Etude comparative de la distribution florale dans une portion des Alpes et du Jura]]></source>
<year>1901</year>
<publisher-name><![CDATA[Impr. Corbaz]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B25">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jiang]]></surname>
<given-names><![CDATA[J. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Conrath]]></surname>
<given-names><![CDATA[D. W.]]></given-names>
</name>
</person-group>
<source><![CDATA[Semantic similarity based on corpus statistics and lexical taxonomy]]></source>
<year>1997</year>
<publisher-name><![CDATA[arXiv]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B26">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Taxicab geometry]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Krause]]></surname>
<given-names><![CDATA[E. F.]]></given-names>
</name>
</person-group>
<source><![CDATA[The Mathematics Teacher]]></source>
<year>1973</year>
<volume>66</volume>
<numero>8</numero>
<issue>8</issue>
<page-range>695-706</page-range></nlm-citation>
</ref>
<ref id="B27">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kucera]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Francis]]></surname>
<given-names><![CDATA[W. N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Frequency analysis of English usage: Lexicon and grammar]]></source>
<year>1982</year>
<publisher-loc><![CDATA[Boston ]]></publisher-loc>
<publisher-name><![CDATA[Houghton Mifflin]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B28">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A solution to plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Landauer]]></surname>
<given-names><![CDATA[T. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Dumais]]></surname>
<given-names><![CDATA[S. T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Psychological review]]></source>
<year>1997</year>
<volume>104</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>211</page-range></nlm-citation>
</ref>
<ref id="B29">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[An introduction to latent semantic analysis]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Landauer]]></surname>
<given-names><![CDATA[T. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Foltz]]></surname>
<given-names><![CDATA[P. W.]]></given-names>
</name>
<name>
<surname><![CDATA[Laham]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Discourse processes]]></source>
<year>1998</year>
<volume>25</volume>
<page-range>259-84</page-range></nlm-citation>
</ref>
<ref id="B30">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Sentence similarity based on semantic nets and corpus statistics]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[McLean]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Bandar]]></surname>
<given-names><![CDATA[Z. A.]]></given-names>
</name>
<name>
<surname><![CDATA[O'shea]]></surname>
<given-names><![CDATA[J. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Crockett]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Transactions on knowledge and data engineering]]></source>
<year>2006</year>
<volume>18</volume>
<numero>8</numero>
<issue>8</issue>
<page-range>1138-50</page-range></nlm-citation>
</ref>
<ref id="B31">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Producing high-dimensional semantic spaces from lexical co-occurrence]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lund]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Burgess]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Behavior Research Methods, Instruments, &amp; Computers]]></source>
<year>1996</year>
<volume>28</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>203-8</page-range></nlm-citation>
</ref>
<ref id="B32">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Building a large annotated corpus of english: The penn treebank]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Marcus]]></surname>
<given-names><![CDATA[M. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Marcinkiewicz]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Santorini]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computational Linguistics]]></source>
<year>1993</year>
<volume>19</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>313-30</page-range></nlm-citation>
</ref>
<ref id="B33">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Matveeva]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Farahat]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Royer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Document representation with generalized latent semantic analysis]]></source>
<year>2005</year>
<conf-name><![CDATA[ Conference 0n Research and Development in Information Retrieval (SIGIR 2005)]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B34">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[McInnes]]></surname>
<given-names><![CDATA[B. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Pedersen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Pakhomov]]></surname>
<given-names><![CDATA[S. V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Umls-interface and umls-similarity: open source software for measuring paths and semantic similarity]]></source>
<year>2009</year>
<conf-name><![CDATA[ AMIA Annual Symposium Proceedings]]></conf-name>
<conf-date>2009</conf-date>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B35">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Corpus-based and knowledge-based measures of text semantic similarity]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mihalcea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Corley]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Strapparava]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[AAAI]]></source>
<year>2006</year>
<volume>6</volume>
<page-range>775-80</page-range></nlm-citation>
</ref>
<ref id="B36">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Wordnet: a lexical database for english]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Miller]]></surname>
<given-names><![CDATA[G. A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Communications of the ACM]]></source>
<year>1995</year>
<volume>38</volume>
<numero>11</numero>
<issue>11</issue>
<page-range>39-41</page-range></nlm-citation>
</ref>
<ref id="B37">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mohler]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bunescu]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Mihalcea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Learning to grade short answer questions using semantic similarity measures and dependency graph alignments]]></source>
<year>2011</year>
<conf-name><![CDATA[ 49Annual Meeting of the Association forComputational Linguistics : Human Language Technologies-Volume 1]]></conf-name>
<conf-loc> </conf-loc>
<page-range>752-62</page-range></nlm-citation>
</ref>
<ref id="B38">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A general method applicable to the search for similarities in the amino acid sequence of two proteins]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Needleman]]></surname>
<given-names><![CDATA[S. B.]]></given-names>
</name>
<name>
<surname><![CDATA[Wunsch]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Journal of molecular biology]]></source>
<year>1970</year>
<volume>48</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>443-53</page-range></nlm-citation>
</ref>
<ref id="B39">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pedersen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Patwardhan]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Michelizzi]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[WordNet:: Similarity: measuring the relat-edness of concepts]]></source>
<year>2004</year>
<publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B40">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Semantic similarity in biomedical ontologies]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pesquita]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Faria]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Falcao]]></surname>
<given-names><![CDATA[A. O.]]></given-names>
</name>
<name>
<surname><![CDATA[Lord]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Couto]]></surname>
<given-names><![CDATA[F. M.]]></given-names>
</name>
</person-group>
<source><![CDATA[PLoS comput biol]]></source>
<year>2009</year>
<volume>5</volume>
<numero>7</numero>
<issue>7</issue>
<page-range>e1000443</page-range></nlm-citation>
</ref>
<ref id="B41">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Posadas-Duran]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Markov]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Gomez-Adorno]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Batyrshin]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Pichardo-Lagunas]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
</person-group>
<source><![CDATA[Syntactic n-grams as features for the author profiling task]]></source>
<year>2015</year>
<conf-name><![CDATA[ CLEF 2015 Evaluation Labs]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B42">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Development and application of a metric on semantic nets]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rada]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Mili]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Bicknell]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Blettner]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE transactions on systems, man, and cybernetics]]></source>
<year>1989</year>
<volume>19</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>17-30</page-range></nlm-citation>
</ref>
<ref id="B43">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ramage]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Rafferty]]></surname>
<given-names><![CDATA[A. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Random walks for text semantic similarity]]></source>
<year>2009</year>
<conf-name><![CDATA[ 2009 workshop on graph-based methods for natural language processing]]></conf-name>
<conf-loc> </conf-loc>
<page-range>23-31</page-range></nlm-citation>
</ref>
<ref id="B44">
<nlm-citation citation-type="">
<article-title xml:lang=""><![CDATA[Wordnet and distributional analysis: A class-based approach to lexical discovery]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Resnik]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[AAAI workshop on statistically-based natural language processing techniques]]></source>
<year>1992</year>
<page-range>56-64</page-range></nlm-citation>
</ref>
<ref id="B45">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Resnik]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Using information content to evaluate semantic similarity in a taxonomy]]></source>
<year>1995</year>
<publisher-name><![CDATA[arXiv]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B46">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Richardson]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Smeaton]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Using wordnet in a knowladge-based approach to information retrieval]]></source>
<year>1995</year>
<publisher-loc><![CDATA[Ireland ]]></publisher-loc>
<publisher-name><![CDATA[School of Computer Applications, Dublin Sity University]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B47">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rocchio]]></surname>
<given-names><![CDATA[J. J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Relevance feedback in information retrieval]]></source>
<year>1971</year>
<publisher-loc><![CDATA[Englewood Cliffs NJ ]]></publisher-loc>
<publisher-name><![CDATA[Prentice-Hall]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B48">
<nlm-citation citation-type="">
<article-title xml:lang=""><![CDATA[Term representation with generalized latent semantic analysis]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Royer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Recent Advances in Natural Language Processing IV: Selected papers from RANLP 2005]]></source>
<year>2007</year>
<volume>292</volume>
<page-range>45</page-range></nlm-citation>
</ref>
<ref id="B49">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rus]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Lintean]]></surname>
<given-names><![CDATA[M. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Banjade]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Niraula]]></surname>
<given-names><![CDATA[N. B.]]></given-names>
</name>
<name>
<surname><![CDATA[Stefanescu]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Semilar: The semantic similarity toolkit]]></source>
<year>2013</year>
<conf-name><![CDATA[ ACL (Conference System Demonstrations)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>163-8</page-range></nlm-citation>
</ref>
<ref id="B50">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Automatic text structuring and summarization]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Salton]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Singhal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Mitra]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Buckley]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Information Processing &amp; Management]]></source>
<year>1997</year>
<volume>33</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>193-207</page-range></nlm-citation>
</ref>
<ref id="B51">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Efficient similarity-based operations for data integration]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schallehn]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Sattler]]></surname>
<given-names><![CDATA[K.-U.]]></given-names>
</name>
<name>
<surname><![CDATA[Saake]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data &amp; Knowledge Engineering]]></source>
<year>2004</year>
<volume>48</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>361-87</page-range><publisher-name><![CDATA[Elsevier]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B52">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sheldon]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[A first course in probability]]></source>
<year>2002</year>
<publisher-loc><![CDATA[India ]]></publisher-loc>
<publisher-name><![CDATA[Pearson Education]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B53">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Non-continuous syntactic n-grams [in Spanish, abstract and examples in English]]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Polibits]]></source>
<year>2013</year>
<volume>48</volume>
<page-range>67-75</page-range></nlm-citation>
</ref>
<ref id="B54">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Non-linear construction of n-grmas in computational lingusitics: Syntactic, filtered, and generalized n-grams]]></source>
<year>2013</year>
<publisher-loc><![CDATA[Mexico ]]></publisher-loc>
<publisher-name><![CDATA[SMIA]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B55">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Should syntactic n-grams contain names of syntactic relations?]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[International Journal of Computational Linguistics and Applications]]></source>
<year>2014</year>
<volume>5</volume>
<page-range>139-58</page-range></nlm-citation>
</ref>
<ref id="B56">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Soft similarity and soft cosine measure: Similarity of features in vector space model]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Gomez-Adorno]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Pinto]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computacion y Sistemas]]></source>
<year>2014</year>
<volume>18</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>491-504</page-range></nlm-citation>
</ref>
<ref id="B57">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Gomez-Adorno]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Markov]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Pinto]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Loya]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computing text similarity using tree edit distance]]></source>
<year>2015</year>
<conf-name><![CDATA[ Annual Conference of the North American Fuzzy Information processing Society and 5th World Conference on Soft Computing, NAFIPS '15]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-4</page-range></nlm-citation>
</ref>
<ref id="B58">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Syntactic n-grams as machine learning features for natural language processing]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Velasquez]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Stamatatos]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Chanona-Hernandez]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Expert Systems with Applications]]></source>
<year>2013</year>
<volume>41</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>853-60</page-range></nlm-citation>
</ref>
<ref id="B59">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Identification of common molecular subsequences]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Smith]]></surname>
<given-names><![CDATA[T. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Waterman]]></surname>
<given-names><![CDATA[M. S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Journal of molecular biology]]></source>
<year>1981</year>
<volume>147</volume>
<numero>1</numero>
<issue>1</issue>
</nlm-citation>
</ref>
<ref id="B60">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on danish commons]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sørensen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Biol. Skr.]]></source>
<year>1948</year>
<volume>5</volume>
<page-range>1-34</page-range></nlm-citation>
</ref>
<ref id="B61">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Steinberger]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Jezek]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Using latent semantic analysis in text summarization and summary evaluation]]></source>
<year>2004</year>
<conf-name><![CDATA[ Proc. ISIM'04]]></conf-name>
<conf-loc> </conf-loc>
<page-range>93-100</page-range></nlm-citation>
</ref>
<ref id="B62">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sussna]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Word sense disambiguation for free-text indexing using a massive semantic network]]></source>
<year>1993</year>
<conf-name><![CDATA[ second international conference on Information and knowledge management]]></conf-name>
<conf-loc> </conf-loc>
<page-range>67-74</page-range></nlm-citation>
</ref>
<ref id="B63">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tan]]></surname>
<given-names><![CDATA[P.-N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Introduction to data mining]]></source>
<year>2006</year>
<publisher-loc><![CDATA[India ]]></publisher-loc>
<publisher-name><![CDATA[Pearson Education]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B64">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Features of similarity]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tversky]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Psychological Review]]></source>
<year>1977</year>
<volume>84</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>327</page-range></nlm-citation>
</ref>
<ref id="B65">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Saric]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Glavas]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Karan]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Snajder]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Basic]]></surname>
<given-names><![CDATA[B. D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Takelab: Systems for measuring semantic text similarity]]></source>
<year>2012</year>
<conf-name><![CDATA[ First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, SemEval '12]]></conf-name>
<conf-loc>Stroudsburg, PA, USA </conf-loc>
<page-range>441-8</page-range></nlm-citation>
</ref>
<ref id="B66">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A model of knowledge based information retrieval with hierarchical concept graph]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Whan Kim]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Kim]]></surname>
<given-names><![CDATA[J. H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Journal of Documentation]]></source>
<year>1990</year>
<volume>46</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>113-36</page-range></nlm-citation>
</ref>
<ref id="B67">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Winkler]]></surname>
<given-names><![CDATA[W. E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Overview of record linkage and current research directions]]></source>
<year>2006</year>
<publisher-name><![CDATA[Bureau of the Census]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B68">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Exploring the similarity space]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zobel]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Moffat]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[ACM SIGIR Forum]]></source>
<year>1998</year>
<volume>32</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>18-34</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
