<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442015000100010</article-id>
<article-id pub-id-type="doi">10.17562/PB-51-9</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Soft Cardinality in Semantic Text Processing: Experience of the SemEval International Competitions]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[Sergio]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Gonzalez]]></surname>
<given-names><![CDATA[Fabio A.]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[Alexander]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Universidad Nacional de Colombia Departamento de Ingeniería de Sistemas e Industrial ]]></institution>
<addr-line><![CDATA[Bogotá ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Instituto Politécnico Nacional Centro de Investigación en Computación ]]></institution>
<addr-line><![CDATA[México Distrito Federal]]></addr-line>
<country>México</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2015</year>
</pub-date>
<numero>51</numero>
<fpage>63</fpage>
<lpage>72</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442015000100010&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442015000100010&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442015000100010&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Soft cardinality is a generalization of the classic set cardinality (i.e., the number of elements in a set), which exploits similarities between elements to provide a "soft" counting of the number of elements in a collection. This model is so general that can be used interchangeability as cardinality function in resemblance coefficients such as Jaccard's, Dice's, cosine and others. Beyond that, cardinality-based features can be extracted from pairs of objects being compared to learn adaptive similarity functions from training data. This approach can be used for comparing any object that can be represented as a set or bag. We and other international teams used soft cardinality to address a series of natural language processing (NLP) tasks in the recent SemEval (semantic evaluation) competitions from 2012 to 2014. The systems based on soft cardinality have always been among the best systems in all the tasks in which they participated. This paper describes our experience in that journey by presenting the generalities of the model and some practical techniques for using soft cardinality for NLP problems.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Similarity measure]]></kwd>
<kwd lng="en"><![CDATA[soft computing]]></kwd>
<kwd lng="en"><![CDATA[set cardinality]]></kwd>
<kwd lng="en"><![CDATA[semantics]]></kwd>
<kwd lng="en"><![CDATA[natural language processing]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="center"><font face="verdana" size="4"><b>Soft Cardinality in Semantic Text Processing: Experience of the SemEval International Competitions</b></font></p>     <p align="center"><font face="verdana" size="4">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="2"><b>Sergio Jimenez<sup>1</sup>, Fabio A. Gonzalez<sup>1</sup>, and Alexander Gelbukh<sup>2</sup></b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><i><sup>1</sup> Departamento de Ingenier&iacute;a de Sistemas e Industrial of the Universidad Nacional de Colombia, Bogota, Colombia.</i> (e&#45;mail: <a href="mailto:fagonzalezo@unaledu.co">fagonzalezo@unaledu.co</a>, <a href="mailto:sergiojimenezvargas@gmail.com">sergiojimenezvargas@gmail.com</a>).</font></p>         <p align="justify"><font face="verdana" size="2"><i><sup>2</sup> Centro de Investigaci&oacute;n en Computaci&oacute;n, Instituto Polit&eacute;cnico Nacional, M&eacute;xico City, M&eacute;xico.</i> (e&#45;mail: <a href="mailto:gelbukh@gelbukh.com">gelbukh@gelbukh.com</a>).</font></p>         <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>         <p align="justify"><font face="verdana" size="2">Manuscript received on February 17, 2015.    <br> Accepted for publication on May 27, 2015.     ]]></body>
<body><![CDATA[<br> Published on June 15, 2015.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>     <p align="justify"><font face="verdana" size="2">Soft cardinality is a generalization of the classic set cardinality (i.e., the number of elements in a set), which exploits similarities between elements to provide a "soft" counting of the number of elements in a collection. This model is so general that can be used interchangeability as cardinality function in resemblance coefficients such as Jaccard's, Dice's, cosine and others. Beyond that, cardinality&#45;based features can be extracted from pairs of objects being compared to learn adaptive similarity functions from training data. This approach can be used for comparing any object that can be represented as a set or bag. We and other international teams used soft cardinality to address a series of natural language processing (NLP) tasks in the recent SemEval (semantic evaluation) competitions from 2012 to 2014. The systems based on soft cardinality have always been among the best systems in all the tasks in which they participated. This paper describes our experience in that journey by presenting the generalities of the model and some practical techniques for using soft cardinality for NLP problems.</font></p>      <p align="justify"><font face="verdana" size="2"><b>Key words: </b>Similarity measure, soft computing, set cardinality, semantics, natural language processing.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><a href="/pdf/poli/n51/n51a10.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>ACKNOWLEDGMENT</b></font></p>  	    <p align="justify"><font face="verdana" size="2">The second author acknowledges the support of LACCIR R1212LAC006 under the project "Multimodal image retrieval to support medical case&#45;based scientific literature search." The third author acknowledges the support of the Mexican Government via SNI, CONACYT, and the Instituto Polit&eacute;cnico Nacional, SIP&#45;IPN grants 20152100 and 20152095.</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>      <p align="justify"><font face="verdana" size="2"><b>REFERENCES</b></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; S. Jimenez, F. Gonzalez, and A. Gelbukh, "Text Comparison Using Soft Cardinality," in <i>String Processing and Information Retrieval,</i> ser. LNCS, E. Chavez and S. Lonardi, Eds. Berlin, Heidelberg: Springer, 2010, vol. 6393, pp. 297&#45;302.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059898&pid=S1870-9044201500010001000001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; S. P. Jena, S. K. Ghosh, and B. K. Tripathy, "On the theory of bags and lists," <i>Information Sciences,</i> vol. 132, no. 1&#45;4, pp. 241&#45;254, 2001.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059900&pid=S1870-9044201500010001000002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;3&#93; P. Jaccard, "Etude comparative de la distribution florare dans une portion des {A}lpes et des {J}ura," <i>Bulletin de la Soci&eacute;t&eacute; Vaudoise des Sciences Naturelles,</i> pp. 547&#45;579, 1901.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059902&pid=S1870-9044201500010001000003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;4&#93; L. R. Dice, "Measures of the Amount of Ecologic Association Between Species," <i>Ecology,</i> vol. 26, no. 3, pp. 297&#45;302, 1945.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059904&pid=S1870-9044201500010001000004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;5&#93; A. Tversky, "Features of similarity," <i>Psychological Review,</i> vol. 84, no. 4, pp. 327&#45;352, 1977.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059906&pid=S1870-9044201500010001000005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; Ochiai, Akira, "Zoogeographical studies on the soleoid fishes found Japan and its neighboring regions," <i>Jap. Soc. Sci. Fish.,</i> vol. 22, no. 9, pp. 526&#45;530, 1957.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059908&pid=S1870-9044201500010001000006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; G. Sidorov, A. Gelbukh, H. Gomez&#45;Adorno, and D. Pinto, "Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model," <i>Computacion y Sistemas,</i> vol. 18, no. 3, pp. 491&#45;504, 2014.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059910&pid=S1870-9044201500010001000007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; S. Jimenez, C. Becerra, and A. Gelbukh, "SOFTCARDINALITY&#45;CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity," in <i>Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task.</i> Atlanta, Georgia, USA: ACL, Jun. 2013, pp. 194&#45;201.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059912&pid=S1870-9044201500010001000008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;9&#93; B. D. Baets, H. D. Meyer, and H. Naessens, "A class of rational cardinality&#45;based similarity measures," <i>Journal ofComputational and Applied Mathematics,</i> vol. 132, no. 1, pp. 51&#45;69, Jul. 2001.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059914&pid=S1870-9044201500010001000009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; R. Poli, W. B. Langdon, N. F. McPhee, and J. R. Koza, <i>Afield guide to genetic programming.</i> Lulu. com, 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059916&pid=S1870-9044201500010001000010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; I. Guyon and A. Elisseeff, "An introduction to variable and feature selection," <i>The Journal of Machine Learning Research,</i> vol. 3, pp. 1157&#45;1182, 2003.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059918&pid=S1870-9044201500010001000011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;12&#93; Jimenez, Sergio, Gonzalez, Fabio A., and Gelbukh, Alexander, "Cardinality&#45;based lexical similarity in WordNet: Bridging the gap to neural embedding," <i>to appear,</i> 2015.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059920&pid=S1870-9044201500010001000012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;13&#93; Due&ntilde;as, George, Jimenez, Sergio, and Julia, Baquero, "Automatic prediction of item difficulty for short&#45;answer questions," in <i>to appear,</i> 2015.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059922&pid=S1870-9044201500010001000013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;14&#93; Bouma, Gerlof, "Normalized (pointwise) mutual information in collocation extraction," in <i>Proceedings of the Biennial GSCL Conference,</i> 2009, pp. 31&#45;40.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059924&pid=S1870-9044201500010001000014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;15&#93; T. Pedersen, S. Patwardhan, and J. Michelizzi, "WordNet::Similarity: measuring the relatedness of concepts," in <i>Proceedings HLT&#45;NAACL&#45;Demonstration Papers.</i> Stroudsburg, PA, USA: ACL, 2004.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059926&pid=S1870-9044201500010001000015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;16&#93; S. Jimenez, C. Becerra, and A. Gelbukh, "Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross&#45;lingual Textual Entailment," in <i>First Joint Conference on Lexical and Computational Semantics (*SEM).</i> Montreal, Canada: ACL, 2012, pp. 684&#45;688.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059928&pid=S1870-9044201500010001000016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;17&#93; A. E. Monge and C. Elkan, "The field matching problem: Algorithms and applications," in <i>Proceeding ofthe 2nd International Conference on Knowledge Discovery and Data Mining (KDD&#45;96),</i> Portland, OR, 1996, pp. 267&#45;270.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059930&pid=S1870-9044201500010001000017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;18&#93; S. Jimenez, C. Becerra, A. Gelbukh, and F. Gonzalez, "Generalized Mongue&#45;Elkan Method for Approximate Text String Comparison," in <i>Computational Linguistics and Intelligent Text Processing,</i> ser. Lecture Notes in Computer Science, A. Gelbukh, Ed. Springer, Jan. 2009, no. 5449, pp. 559&#45;570.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059932&pid=S1870-9044201500010001000018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;19&#93; G. Salton, <i>Introduction to modern information retrieval.</i> McGraw&#45;Hill, 1983.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059934&pid=S1870-9044201500010001000019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;20&#93; S. Robertson, S. Walker, S. Jones, M. M. Hancock&#45;Beaulieu, and M. Gatford, "Okapi at TREC&#45;3," in <i>Proceedings of the Third Text REtrieval Conference (TREC 1994),</i> Gaithersburg, USA, 1994, pp. 109&#45;126.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059936&pid=S1870-9044201500010001000020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;21&#93; Jimenez, Sergio, Gonzalez, Fabio A., and Gelbukh, Alexander, "Mathematical properties of Soft Cardinality: Enhancing Jaccard, Dice and cosine similarity measures with element&#45;wise distance," <i>to appear,</i> 2015.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059938&pid=S1870-9044201500010001000021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;22&#93; V. I. Levenshtein, "Binary codes capable of correcting deletions, insertions, and reversals," <i>Soviet Physics Doklady,</i> vol. 10, no. 8, pp. 707&#45;710, 1966.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059940&pid=S1870-9044201500010001000022&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;23&#93; W. E. Winkler, "The State of Record Linkage and Current Research Problems," <i>Statistical Research Division, US Census Bureau,</i> 1999.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059942&pid=S1870-9044201500010001000023&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;24&#93; A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios, "Duplicate Record Detection: A Survey," <i>IEEE Trans. on Knowl. and Data Eng.,</i> vol. 19, no. 1, pp. 1&#45;16, 2007.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059944&pid=S1870-9044201500010001000024&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;25&#93; M. Marelli, L. Bentivogli, M. Baroni, R. Bernardi, S. Menini, and R. Zamparelli, "Semeval&#45;2014 task 1: Evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment," in <i>Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014).</i> Dublin, Ireland: ACL, 2014, pp. 1&#45;8.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059946&pid=S1870-9044201500010001000025&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;26&#93; B. T. McInnes, T. Pedersen, Y. Liu, G. B. Melton, and S. V. Pakhomov, "U&#45;path: An undirected path&#45;based measure of semantic similarity," in <i>AMIA Annual Symposium Proceedings,</i> vol. 2014. American Medical Informatics Association, 2014, p. 882.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059948&pid=S1870-9044201500010001000026&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;27&#93; E. Agirre, E. Alfonseca, K. Hall, J. Kravalova, M. Pasca, and A. Soroa, "A study on similarity and relatedness using distributional and WordNet&#45;based approaches," in <i>Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics,</i> ser. NAACL'09. Stroudsburg, PA, USA: ACL, 2009, pp. 19&#45;27.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059950&pid=S1870-9044201500010001000027&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;28&#93; Mikolov, Tomas, Sutskever, Ilya, Chen, Kai, Corrado, Greg, and Dean, Jeff, "Distributed representations of words and phrases and their compositionality," in <i>Advances in Neural Information Processing Systems (NIPS),</i> 2013, pp. 3111&#45;3119.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059952&pid=S1870-9044201500010001000028&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;29&#93; Pennington, Jeffrey, Socher, Richard, and Manning, Christopher D., "Glove: Global vectors for word representation," in <i>Proceedings ofthe Empiricial Methods in Natural Language Processing (EMNLP 2014),</i> vol. 12, Doha, Qatar, 2014, pp. 1532&#45;1543.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059954&pid=S1870-9044201500010001000029&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;30&#93; E. Gabrilovich and S. Markovitch, "Computing Semantic Relatedness Using Wikipedia&#45;based Explicit Semantic Analysis," in <i>Proceedings of the 20th International Joint Conference on Artifical Intelligence,</i> ser. IJCAI'07. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2007, pp. 1606&#45;1611.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059956&pid=S1870-9044201500010001000030&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;31&#93; S. Banerjee and T. Pedersen, "An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet," in <i>Computational Linguistics and Intelligent Text Processing,</i> ser. Lecture Notes in Computer Science, A. Gelbukh, Ed. Springer, 2002, no. 2276, pp. 136&#45;145.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059958&pid=S1870-9044201500010001000031&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;32&#93; C. Corley and R. Mihalcea, "Measuring the semantic similarity of texts," in <i>Proceedings ofthe ACL Workshop on Empirical Modeling ofSemantic Equivalence and Entailment,</i> ser. EMSEE'05. Stroudsburg, PA, USA: Association for Computational Linguistics, 2005, pp. 13&#45;18.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059960&pid=S1870-9044201500010001000032&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;33&#93; D. Croce, V. Storch, P. Annesi, and R. Basili, "Distributional Compositional Semantics and Text Similarity," in <i>Proceedings of the IEEE Sixth International Conference on Semantic Computing (ICSC),</i> SEP. 2012, pp. 242&#45;249.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059962&pid=S1870-9044201500010001000033&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;34&#93; D. Croce, V. Storch, and R. Basili, "UNITOR&#45;CORE TYPED: Combining Text Similarity and Semantic Filters through SV Regression," in <i>Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings ofthe Main Conference and the Shared Task: SemanticTextual Similarity.</i> Atlanta, Georgia, USA: ACL, 2013, pp. 59&#45;65.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059964&pid=S1870-9044201500010001000034&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;35&#93; M.&#45;C. De Marneffe, B. MacCartney, C. D. Manning, and others, "Generating typed dependency parses from phrase structure parses," in <i>proceedings of LREC,</i> vol. 6, 2006, pp. 449&#45;454.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059966&pid=S1870-9044201500010001000035&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"> &#91;36&#93; M. D. Lee, B. Pincombe, and M. Welsh, "An empirical evaluation of models of text document similarity," in <i>In CogSci2005.</i> Erlbaum, 2005, pp. 1254&#45;1259.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059968&pid=S1870-9044201500010001000036&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;37&#93; E. Agirre, D. Cer, M. Diab, and G.&#45;A. Aitor, "SemEval&#45;2012 Task 6: A Pilot on Semantic Textual Similarity," in <i>First Joint Conference on Lexical and Computational Semantics (*SEM).</i> Montreal, Canada: ACL, 2012, pp. 385&#45;393.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059970&pid=S1870-9044201500010001000037&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;38&#93; E. Agirre, C. Banea, C. Cardie, D. Cer, M. Diab, A. Gonzalez&#45;Aguirre, W. Guo, R. Mihalcea, G. Rigau, and J. Wiebe, "SemEval&#45;2014 Task 10: Multilingual semantic textual similarity," in <i>Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014).</i> Dublin, Ireland: ACL, 2014, pp. 81&#45;91.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059972&pid=S1870-9044201500010001000038&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;39&#93; A. Lynum, P. Pakray, B. Gamback, and S. Jimenez, "NTNU: Measuring Semantic Similarity with Sublexical Feature Representations and Soft Cardinality," in <i>Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014).</i> Dublin, Ireland: ACL, 2014, pp. 448&#45;453.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059974&pid=S1870-9044201500010001000039&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;40&#93; S. Jimenez, G. Duenas, J. Baquero, and A. Gelbukh, "UNAL&#45;NLP: Combining soft cardinality features for semantic textual similarity, relatedness and entailment," in <i>Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014).</i> Dublin, Ireland: ACL, 2014, pp. 732&#45;742.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059976&pid=S1870-9044201500010001000040&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;41&#93; D. Jurgens, M. T. Pilehvar, and R. Navigli, "SemEval&#45;2014 Task 3: Cross&#45;level semantic similarity," in <i>Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014).</i> Dublin, Ireland: ACL, 2014, pp. 17&#45;26.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059978&pid=S1870-9044201500010001000041&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;42&#93; M. Negri, A. Marchetti, Y. Mehdad, L. Bentivogli, and D. Giampiccolo, "2012. Semeval&#45;2012 Task 8: Cross&#45;lingual Textual Entailment for Content Synchronization," in <i>First Joint Conference on Lexical and Computational Semantics (*SEM).</i> Montreal, Canada: ACL, 2012, pp. 399&#45;407.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059980&pid=S1870-9044201500010001000042&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;43&#93; M. Negri, A. Marchetti, Y. Mehdad, and L. Bentivogli, "Semeval&#45;2013 Task 8: Cross&#45;lingual Textual Entailment for Content Synchronization," in <i>Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval 2013).</i> Atlanta, Georgia, USA: ACL, 2013, pp. 25&#45;33.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059982&pid=S1870-9044201500010001000043&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;44&#93; S. Jimenez, C. Becerra, and A. Gelbukh, "SOFTCARDINALITY: Hierarchical Text Overlap for Student Response Analysis," in <i>Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Seventh International Workshop on Semantic Evaluation (SemEval 2013).</i> Atlanta, Georgia, USA: ACL, 2013, pp. 280&#45;284.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059984&pid=S1870-9044201500010001000044&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;45&#93; &#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;, "SOFTCARDINALITY: Learning to Identify Directional Cross&#45;Lingual Entailment from Cardinalities and SMT," in <i>Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Seventh International Workshop on Semantic Evaluation (SemEval 2013).</i> Atlanta, Georgia, USA: ACL, Jun. 2013, pp. 34&#45;38.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059986&pid=S1870-9044201500010001000045&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;46&#93; &#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;&#45;, "Soft Cardinality: A Parameterized Similarity Function for Text Comparison," in <i>First Joint Conference on Lexical and Computational Semantics (*SEM).</i> Montreal, Canada: ACL, 2012, pp. 449&#45;453.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059988&pid=S1870-9044201500010001000046&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;47&#93; M. O. Dzikovska, R. D. Nielsen, C. Brew, C. Leacock, D. Giampiccolo, L. Bentivogli, P. Clark, I. Dagan, and H. T. Dang, "SemEval&#45;2013 Task 7: The Joint Student Response Analysis and 8th Recognizing Textual Entailment Challenge," in <i>Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval 2013), in conjunction with the Second Joint Conference on Lexical and Computational Semantcis (*SEM 2013).</i> Atlanta, Georgia, USA: ACL, 2013, pp. 263&#45;274.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059990&pid=S1870-9044201500010001000047&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;48&#93; S. P. Leeman&#45;Munk, E. N. Wiebe, and J. C. Lester, "Assessing elementary students' science competency with text analytics," in <i>Proceedins ofthe Fourth International Conference on Learning Analytics And Knowledge (LAK 14).</i> Indianapolis, Indiana, USA: ACM, 2014, pp. 143&#45;147.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6059992&pid=S1870-9044201500010001000048&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Text Comparison Using Soft Cardinality]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Chavez]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Lonardi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[String Processing and Information Retrieval]]></source>
<year>2010</year>
<volume>6393</volume>
<page-range>297-302</page-range><publisher-loc><![CDATA[BerlinHeidelberg ]]></publisher-loc>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jena]]></surname>
<given-names><![CDATA[S. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Ghosh]]></surname>
<given-names><![CDATA[S. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Tripathy]]></surname>
<given-names><![CDATA[B. K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[On the theory of bags and lists]]></article-title>
<source><![CDATA[Information Sciences]]></source>
<year>2001</year>
<volume>132</volume>
<numero>1</numero><numero>4</numero>
<issue>1</issue><issue>4</issue>
<page-range>241-254</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jaccard]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="fr"><![CDATA[Etude comparative de la distribution florare dans une portion des {A}lpes et des {J}ura]]></article-title>
<source><![CDATA[Bulletin de la Société Vaudoise des Sciences Naturelles]]></source>
<year>1901</year>
<page-range>547-579</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dice]]></surname>
<given-names><![CDATA[L. R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Measures of the Amount of Ecologic Association Between Species]]></article-title>
<source><![CDATA[Ecology]]></source>
<year>1945</year>
<volume>26</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>297-302</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tversky]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Features of similarity]]></article-title>
<source><![CDATA[Psychological Review]]></source>
<year>1977</year>
<volume>84</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>327-352</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ochiai]]></surname>
<given-names><![CDATA[Akira]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Zoogeographical studies on the soleoid fishes found Japan and its neighboring regions]]></article-title>
<source><![CDATA[Jap. Soc. Sci. Fish.]]></source>
<year>1957</year>
<volume>22</volume>
<numero>9</numero>
<issue>9</issue>
<page-range>526-530</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gomez-Adorno]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Pinto]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model]]></article-title>
<source><![CDATA[Computacion y Sistemas]]></source>
<year>2014</year>
<volume>18</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>491-504</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Becerra]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity]]></article-title>
<source><![CDATA[Second Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>Jun.</year>
<month> 2</month>
<day>01</day>
<volume>1</volume>
<page-range>194-201</page-range><publisher-loc><![CDATA[Atlanta^eGeorgia Georgia]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Baets]]></surname>
<given-names><![CDATA[B. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Meyer]]></surname>
<given-names><![CDATA[H. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Naessens]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A class of rational cardinality-based similarity measures]]></article-title>
<source><![CDATA[Journal ofComputational and Applied Mathematics]]></source>
<year>Jul.</year>
<month> 2</month>
<day>00</day>
<volume>132</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>51-69</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Poli]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Langdon]]></surname>
<given-names><![CDATA[W. B.]]></given-names>
</name>
<name>
<surname><![CDATA[McPhee]]></surname>
<given-names><![CDATA[N. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Koza]]></surname>
<given-names><![CDATA[J. R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Afield guide to genetic programming]]></source>
<year>2008</year>
<publisher-name><![CDATA[Lulu. com]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Guyon]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Elisseeff]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An introduction to variable and feature selection]]></article-title>
<source><![CDATA[The Journal of Machine Learning Research]]></source>
<year>2003</year>
<volume>3</volume>
<page-range>1157-1182</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[Sergio]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez]]></surname>
<given-names><![CDATA[Fabio A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[Alexander]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Cardinality-based lexical similarity in WordNet: Bridging the gap to neural embedding]]></article-title>
<source><![CDATA[to appear]]></source>
<year>2015</year>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dueñas]]></surname>
<given-names><![CDATA[George]]></given-names>
</name>
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[Sergio]]></given-names>
</name>
<name>
<surname><![CDATA[Baquero]]></surname>
<given-names><![CDATA[Julia]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic prediction of item difficulty for short-answer questions]]></article-title>
<source><![CDATA[to appear]]></source>
<year>2015</year>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bouma]]></surname>
<given-names><![CDATA[Gerlof]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Normalized (pointwise) mutual information in collocation extraction]]></article-title>
<source><![CDATA[Proceedings of the Biennial GSCL Conference]]></source>
<year>2009</year>
<page-range>31-40</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pedersen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Patwardhan]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Michelizzi]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[WordNet::Similarity: measuring the relatedness of concepts]]></article-title>
<source><![CDATA[Proceedings HLT-NAACL-Demonstration Papers]]></source>
<year>2004</year>
<publisher-loc><![CDATA[Stroudsburg^ePA PA]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Becerra]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment]]></article-title>
<source><![CDATA[First Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>2012</year>
<page-range>684-688</page-range><publisher-loc><![CDATA[Montreal ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Monge]]></surname>
<given-names><![CDATA[A. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Elkan]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The field matching problem: Algorithms and applications]]></article-title>
<source><![CDATA[Proceeding ofthe 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96)]]></source>
<year>1996</year>
<page-range>267-270</page-range><publisher-loc><![CDATA[Portland^eOR OR]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Becerra]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Generalized Mongue-Elkan Method for Approximate Text String Comparison]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computational Linguistics and Intelligent Text Processing]]></source>
<year>Jan.</year>
<month> 2</month>
<day>00</day>
<page-range>559-570</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Salton]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Introduction to modern information retrieval]]></source>
<year>1983</year>
<publisher-name><![CDATA[McGraw-Hill]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Robertson]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Walker]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Jones]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Hancock-Beaulieu]]></surname>
<given-names><![CDATA[M. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Gatford]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Okapi at TREC-3]]></article-title>
<source><![CDATA[Proceedings of the Third Text REtrieval Conference (TREC 1994)]]></source>
<year>1994</year>
<page-range>109-126</page-range><publisher-loc><![CDATA[Gaithersburg ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[Sergio]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez]]></surname>
<given-names><![CDATA[Fabio A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[Alexander]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Mathematical properties of Soft Cardinality: Enhancing Jaccard, Dice and cosine similarity measures with element-wise distance]]></article-title>
<source><![CDATA[to appear]]></source>
<year>2015</year>
</nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Levenshtein]]></surname>
<given-names><![CDATA[V. I.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Binary codes capable of correcting deletions, insertions, and reversals]]></article-title>
<source><![CDATA[Soviet Physics Doklady]]></source>
<year>1966</year>
<volume>10</volume>
<numero>8</numero>
<issue>8</issue>
<page-range>707-710</page-range></nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Winkler]]></surname>
<given-names><![CDATA[W. E.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The State of Record Linkage and Current Research Problems]]></article-title>
<source><![CDATA[Statistical Research Division]]></source>
<year>1999</year>
<publisher-name><![CDATA[Census Bureau]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Elmagarmid]]></surname>
<given-names><![CDATA[A. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Ipeirotis]]></surname>
<given-names><![CDATA[P. G.]]></given-names>
</name>
<name>
<surname><![CDATA[Verykios]]></surname>
<given-names><![CDATA[V. S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Duplicate Record Detection: A Survey]]></article-title>
<source><![CDATA[IEEE Trans. on Knowl. and Data Eng.]]></source>
<year>2007</year>
<volume>19</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>1-16</page-range></nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Marelli]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bentivogli]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Baroni]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bernardi]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Menini]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Zamparelli]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Semeval-2014 task 1: Evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment]]></article-title>
<source><![CDATA[Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)]]></source>
<year>2014</year>
<page-range>1-8</page-range><publisher-loc><![CDATA[Dublin^eIreland Ireland]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[McInnes]]></surname>
<given-names><![CDATA[B. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Pedersen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Melton]]></surname>
<given-names><![CDATA[G. B.]]></given-names>
</name>
<name>
<surname><![CDATA[Pakhomov]]></surname>
<given-names><![CDATA[S. V.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[U-path: An undirected path-based measure of semantic similarity]]></article-title>
<source><![CDATA[AMIA Annual Symposium Proceedings]]></source>
<year>2014</year>
<volume>2014</volume>
<page-range>882</page-range><publisher-name><![CDATA[American Medical Informatics Association]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agirre]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Alfonseca]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Hall]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Kravalova]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Pasca]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Soroa]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A study on similarity and relatedness using distributional and WordNet-based approaches]]></article-title>
<source><![CDATA[Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics]]></source>
<year>2009</year>
<page-range>19-27</page-range><publisher-loc><![CDATA[Stroudsburg^ePA PA]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B28">
<label>28</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mikolov]]></surname>
<given-names><![CDATA[Tomas]]></given-names>
</name>
<name>
<surname><![CDATA[Sutskever]]></surname>
<given-names><![CDATA[Ilya]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[Kai]]></given-names>
</name>
<name>
<surname><![CDATA[Corrado]]></surname>
<given-names><![CDATA[Greg]]></given-names>
</name>
<name>
<surname><![CDATA[Dean]]></surname>
<given-names><![CDATA[Jeff]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Distributed representations of words and phrases and their compositionality]]></article-title>
<source><![CDATA[Advances in Neural Information Processing Systems (NIPS)]]></source>
<year>2013</year>
<page-range>3111-3119</page-range></nlm-citation>
</ref>
<ref id="B29">
<label>29</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pennington]]></surname>
<given-names><![CDATA[Jeffrey]]></given-names>
</name>
<name>
<surname><![CDATA[Socher]]></surname>
<given-names><![CDATA[Richard]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[Christopher D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Glove: Global vectors for word representation]]></article-title>
<source><![CDATA[Proceedings ofthe Empiricial Methods in Natural Language Processing (EMNLP 2014)]]></source>
<year>2014</year>
<volume>12</volume>
<page-range>1532-1543</page-range><publisher-loc><![CDATA[Doha ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B30">
<label>30</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gabrilovich]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Markovitch]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis]]></article-title>
<source><![CDATA[Proceedings of the 20th International Joint Conference on Artifical Intelligence]]></source>
<year>2007</year>
<page-range>1606-1611</page-range><publisher-loc><![CDATA[San Francisco^eCA CA]]></publisher-loc>
<publisher-name><![CDATA[IJCAI'07Morgan Kaufmann Publishers Inc.]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B31">
<label>31</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Banerjee]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Pedersen]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computational Linguistics and Intelligent Text Processing]]></source>
<year>2002</year>
<page-range>136-145</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B32">
<label>32</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Corley]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Mihalcea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Measuring the semantic similarity of texts]]></article-title>
<source><![CDATA[Proceedings ofthe ACL Workshop on Empirical Modeling ofSemantic Equivalence and Entailment]]></source>
<year>2005</year>
<page-range>13-18</page-range><publisher-loc><![CDATA[Stroudsburg^ePA PA]]></publisher-loc>
<publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B33">
<label>33</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Croce]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Storch]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Annesi]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Basili]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Distributional Compositional Semantics and Text Similarity]]></article-title>
<source><![CDATA[Proceedings of the IEEE Sixth International Conference on Semantic Computing (ICSC)]]></source>
<year>SEP.</year>
<month> 2</month>
<day>01</day>
<page-range>242-249</page-range></nlm-citation>
</ref>
<ref id="B34">
<label>34</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Croce]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Storch]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Basili]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[UNITOR-CORE TYPED: Combining Text Similarity and Semantic Filters through SV Regression]]></article-title>
<source><![CDATA[Second Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>2013</year>
<volume>1</volume>
<page-range>59-65</page-range><publisher-loc><![CDATA[Atlanta^eGeorgia Georgia]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B35">
<label>35</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[De]]></surname>
<given-names><![CDATA[M.-C.]]></given-names>
</name>
<name>
<surname><![CDATA[MacCartney]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Generating typed dependency parses from phrase structure parses]]></article-title>
<collab>LREC</collab>
<source><![CDATA[]]></source>
<year>2006</year>
<volume>6</volume>
<page-range>449-454</page-range></nlm-citation>
</ref>
<ref id="B36">
<label>36</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[M. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Pincombe]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Welsh]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An empirical evaluation of models of text document similarity]]></article-title>
<collab>CogSci</collab>
<source><![CDATA[]]></source>
<year>2005</year>
<page-range>1254-1259</page-range><publisher-name><![CDATA[Erlbaum]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B37">
<label>37</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agirre]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Cer]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Diab]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Aitor]]></surname>
<given-names><![CDATA[G.-A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity]]></article-title>
<source><![CDATA[First Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>2012</year>
<page-range>385-393</page-range><publisher-loc><![CDATA[Montreal ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B38">
<label>38</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agirre]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Banea]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cardie]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cer]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Diab]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez-Aguirre]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Guo]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Mihalcea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Rigau]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Wiebe]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[SemEval-2014 Task 10: Multilingual semantic textual similarity]]></article-title>
<source><![CDATA[Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)]]></source>
<year>2014</year>
<page-range>81-91</page-range><publisher-loc><![CDATA[Dublin ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B39">
<label>39</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lynum]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Pakray]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Gamback]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[NTNU: Measuring Semantic Similarity with Sublexical Feature Representations and Soft Cardinality]]></article-title>
<source><![CDATA[Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)]]></source>
<year>2014</year>
<page-range>448-453</page-range><publisher-loc><![CDATA[Dublin ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B40">
<label>40</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Duenas]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Baquero]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[UNAL-NLP: Combining soft cardinality features for semantic textual similarity, relatedness and entailment]]></article-title>
<source><![CDATA[Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)]]></source>
<year>2014</year>
<page-range>732-742</page-range><publisher-loc><![CDATA[Dublin ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B41">
<label>41</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jurgens]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Pilehvar]]></surname>
<given-names><![CDATA[M. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Navigli]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[SemEval-2014 Task 3: Cross-level semantic similarity]]></article-title>
<source><![CDATA[Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)]]></source>
<year>2014</year>
<page-range>17-26</page-range><publisher-loc><![CDATA[Dublin ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B42">
<label>42</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Negri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Marchetti]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Mehdad]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Bentivogli]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Giampiccolo]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[2012. Semeval-2012 Task 8: Cross-lingual Textual Entailment for Content Synchronization]]></article-title>
<source><![CDATA[First Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>2012</year>
<page-range>399-407</page-range><publisher-loc><![CDATA[Montreal ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B43">
<label>43</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Negri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Marchetti]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Mehdad]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Bentivogli]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Semeval-2013 Task 8: Cross-lingual Textual Entailment for Content Synchronization]]></article-title>
<source><![CDATA[Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval 2013)]]></source>
<year>2013</year>
<page-range>25-33</page-range><publisher-loc><![CDATA[Atlanta^eGeorgia Georgia]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B44">
<label>44</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Becerra]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[SOFTCARDINALITY: Hierarchical Text Overlap for Student Response Analysis]]></article-title>
<source><![CDATA[Second Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>2013</year>
<volume>2</volume>
<page-range>280-284</page-range><publisher-loc><![CDATA[Atlanta^eGeorgia Georgia]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B45">
<label>45</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT]]></article-title>
<source><![CDATA[Second Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>Jun.</year>
<month> 2</month>
<day>01</day>
<volume>2</volume>
<page-range>34-38</page-range><publisher-loc><![CDATA[Atlanta^eGeorgia Georgia]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B46">
<label>46</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jimenez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Soft Cardinality: A Parameterized Similarity Function for Text Comparison]]></article-title>
<source><![CDATA[First Joint Conference on Lexical and Computational Semantics (*SEM)]]></source>
<year>2012</year>
<page-range>449-453</page-range><publisher-loc><![CDATA[Montreal ]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B47">
<label>47</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dzikovska]]></surname>
<given-names><![CDATA[M. O.]]></given-names>
</name>
<name>
<surname><![CDATA[Nielsen]]></surname>
<given-names><![CDATA[R. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Brew]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Leacock]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Giampiccolo]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Bentivogli]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Clark]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Dagan]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Dang]]></surname>
<given-names><![CDATA[H. T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[SemEval-2013 Task 7: The Joint Student Response Analysis and 8th Recognizing Textual Entailment Challenge]]></article-title>
<source><![CDATA[Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval 2013), in conjunction with the Second Joint Conference on Lexical and Computational Semantcis (*SEM 2013)]]></source>
<year>2013</year>
<page-range>263-274</page-range><publisher-loc><![CDATA[Atlanta^eGeorgia Georgia]]></publisher-loc>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B48">
<label>48</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Leeman-Munk]]></surname>
<given-names><![CDATA[S. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Wiebe]]></surname>
<given-names><![CDATA[E. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Lester]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Assessing elementary students' science competency with text analytics]]></article-title>
<source><![CDATA[Proceedins ofthe Fourth International Conference on Learning Analytics And Knowledge (LAK 14)]]></source>
<year>2014</year>
<page-range>143-147</page-range><publisher-loc><![CDATA[Indianapolis^eIndiana Indiana]]></publisher-loc>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
