<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462013000200009</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[A Knowledge-Base Oriented Approach for Automatic Keyword Extraction]]></article-title>
<article-title xml:lang="es"><![CDATA[El enfoque basado en conocimiento para la extracción automática de palabras clave]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Jean-Louis]]></surname>
<given-names><![CDATA[Ludovic]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Gagnon]]></surname>
<given-names><![CDATA[Michel]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Charton]]></surname>
<given-names><![CDATA[Eric]]></given-names>
</name>
<xref ref-type="aff" rid="A03"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,École Polytechnique de Montréal  ]]></institution>
<addr-line><![CDATA[Montréal QC]]></addr-line>
<country>Canada</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Centre de Recherche Informatique de Montréal  ]]></institution>
<addr-line><![CDATA[Montréal QC]]></addr-line>
<country>Canada</country>
</aff>
<aff id="A03">
<institution><![CDATA[,École Polytechnique de Montréal  ]]></institution>
<addr-line><![CDATA[Montréal QC]]></addr-line>
<country>Canada</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2013</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2013</year>
</pub-date>
<volume>17</volume>
<numero>2</numero>
<fpage>187</fpage>
<lpage>196</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462013000200009&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462013000200009&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462013000200009&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Automatic keyword extraction is an important subfield of information extraction process. It is a difficult task, where numerous different techniques and resources have been proposed. In this paper, we propose a generic approach to extract keyword from documents using encyclopedic knowledge. Our two-step approach first relies on a classification step for identifying candidate keywords followed by a learning-to-rank method depending on a user-defined keyword profile to order the candidates. The novelty of our approach relies on i) the usage of the keyword profile ii) generic features derived from Wikipedia categories and not necessarily related to the document content. We evaluate our system on keyword datasets and corpora from standard evaluation campaign and show that our system improves the global process of keyword extraction.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[Extracción de palabras clave es una tarea importante del proceso de extracción de información. Esta tarea es difícil de realizar; con la intención de lograrlo muchas distintas técnicas y recursos han sido propuestos. En este artículo se propone el enfoque genérico para extraer palabras clave de documentos usando el conocimiento enciclopédico. El enfoque incluye dos etapas; primero se realiza clasificación con el fin de identificar candidatos a palabras clave y luego se aplica el método de aprendizaje de ranking dependiente del perfil de palabras clave definido por el usuario para ordenar los candidatos. La novedad del enfoque se basa en 1) el uso del perfil de palabras clave y 2) las características genéricas derivadas de las categorías de Wikipedia y no necesariamente relacionadas con el contenido del documento. El sistema se ha evaluado sobre conjuntos de datos de palabras clave y corpus de la campaña de evaluación estándar y se ha demostrado que el sistema propuesto mejora el procedimiento global de extracción de palabras clave.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Automatic keyword extraction]]></kwd>
<kwd lng="en"><![CDATA[encyclopedic knowledge]]></kwd>
<kwd lng="es"><![CDATA[Extracción automática de palabras clave]]></kwd>
<kwd lng="es"><![CDATA[conocimiento enciclopédico]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="justify"><font face="verdana" size="4">Art&iacute;culos</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="4"><b>A Knowledge&#45;Base Oriented Approach for Automatic Keyword Extraction</b></font></p>  	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="3"><b>El enfoque basado en conocimiento para la extracci&oacute;n autom&aacute;tica de palabras clave</b></font></p>  	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="2"><b>Ludovic Jean&#45;Louis<sup>1</sup>, Michel Gagnon<sup>1</sup>, and Eric Charton<sup>3</sup></b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><sup><i>1</i></sup> <i>&Eacute;cole Polytechnique de Montr&eacute;al, Montr&eacute;al, QC, Canada</i> <a href="mailto:ludovic.jean&#45;louis@polymtl.ca">ludovic.jean&#45;louis@polymtl.ca</a></font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><sup><i>2</i></sup> <i>Centre de Recherche Informatique de Montr&eacute;al, Montr&eacute;al, QC, Canada</i> <a href="mailto:michel.gagnon@polymtl.ca">michel.gagnon@polymtl.ca</a></font></p>  	    <p align="justify"><font face="verdana" size="2"><sup><i>3</i></sup> <i>&Eacute;cole Polytechnique de Montr&eacute;al, Montr&eacute;al, QC, Canada and Centre de Recherche Informatique de Montr&eacute;al, Montr&eacute;al, QC, Canada</i> <a href="mailto:eric.charton@crim.ca">eric.charton@crim.ca</a></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2">Article received on 08/12/2012    <br> 	Accepted on 17/01/2013.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Automatic keyword extraction is an important subfield of information extraction process. It is a difficult task, where numerous different techniques and resources have been proposed. In this paper, we propose a generic approach to extract keyword from documents using encyclopedic knowledge. Our two&#45;step approach first relies on a classification step for identifying candidate keywords followed by a learning&#45;to&#45;rank method depending on a user&#45;defined keyword profile to order the candidates. The novelty of our approach relies on i) the usage of the keyword profile ii) generic features derived from Wikipedia categories and not necessarily related to the document content. We evaluate our system on keyword datasets and corpora from standard evaluation campaign and show that our system improves the global process of keyword extraction.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Keywords:</b> Automatic keyword extraction, encyclopedic knowledge.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Resumen</b></font></p>  	    <p align="justify"><font face="verdana" size="2">Extracci&oacute;n de palabras clave es una tarea importante del proceso de extracci&oacute;n de informaci&oacute;n. Esta tarea es dif&iacute;cil de realizar; con la intenci&oacute;n de lograrlo muchas distintas t&eacute;cnicas y recursos han sido propuestos. En este art&iacute;culo se propone el enfoque gen&eacute;rico para extraer palabras clave de documentos usando el conocimiento enciclop&eacute;dico. El enfoque incluye dos etapas; primero se realiza clasificaci&oacute;n con el fin de identificar candidatos a palabras clave y luego se aplica el m&eacute;todo de aprendizaje de ranking dependiente del perfil de palabras clave definido por el usuario para ordenar los candidatos. La novedad del enfoque se basa en 1) el uso del perfil de palabras clave y 2) las caracter&iacute;sticas gen&eacute;ricas derivadas de las categor&iacute;as de Wikipedia y no necesariamente relacionadas con el contenido del documento. El sistema se ha evaluado sobre conjuntos de datos de palabras clave y corpus de la campa&ntilde;a de evaluaci&oacute;n est&aacute;ndar y se ha demostrado que el sistema propuesto mejora el procedimiento global de extracci&oacute;n de palabras clave.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Palabras clave:</b> Extracci&oacute;n autom&aacute;tica de palabras clave, conocimiento enciclop&eacute;dico.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><a href="/pdf/cys/v17n2/v17n2a9.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>References</b></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>1. Breiman, L. (1996).</b> Bagging predictors. <i>Machine Learning,</i> 24(2), 123&#45;140.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061085&pid=S1405-5546201300020000900001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>2. Charton, E., Camelin, N., Acuna&#45;Agost, R., Gotab, P., Lavalley, R., Kessler, R., &amp; Fernandez, S. (2008).</b> Pr&eacute;&#45;traitements classiques ou par analyse distributionnelle: application aux m&eacute;thodes de classification automatique d&eacute;ploy&eacute;es pour deft08. In <i>Actes DEFT08&#45;TALN'08.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061087&pid=S1405-5546201300020000900002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>3. Chen, P.&#45;I. &amp; Lin, S.&#45;J. (2010).</b> Automatic keyword prediction using google similarity distance. <i>Expert Systems with Applications,</i> 37.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061089&pid=S1405-5546201300020000900003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>4. Eichler, K. &amp; Neumann, G. (2010).</b> Dfki keywe: Ranking keyphrases extracted from scientific articles. In <i>Proceedings of the 5th International Workshop on Semantic Evaluation.</i> ACL.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061091&pid=S1405-5546201300020000900004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>5. Grineva, M., Grinev, M., &amp; Lizorkin, D. (2009).</b> Extracting key terms from noisy and multitheme documents. In <i>Proceedings of WWW '09.</i> ACM.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061093&pid=S1405-5546201300020000900005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>6. Hammouda, K. M., Matute, D. N., &amp; Kamel,</b> <b>M. S. (2005).</b> Corephrase: keyphrase extraction for document clustering. In <i>Proceedings of MLDM'05.</i> Springer&#45;Verlag.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061095&pid=S1405-5546201300020000900006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>7. Hulth, A. (2003).</b> Improved automatic keyword extraction given more linguistic knowledge. In <i>Proceedings of EMNLP '03.</i> ACL.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061097&pid=S1405-5546201300020000900007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>8. Kim, S. N., Medelyan, O., Kan, M.&#45;Y., &amp; Baldwin, T. (2010).</b> Semeval&#45;2010 task 5: Automatic keyphrase extraction from scientific articles. In <i>Proceedings of the 5th International Workshop on Semantic Evaluation.</i> ACL, Sweden.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061099&pid=S1405-5546201300020000900008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>9. Lopez, P. &amp; Romary, L. (2010).</b> Humb: Automatic key term extraction from scientific articles in grobid. In <i>Proceedings of the 5th International Workshop on Semantic Evaluation.</i> ACL, Sweden.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061101&pid=S1405-5546201300020000900009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>10. Matsuo, Y. &amp; Ishizuka, M. (2004).</b> Keyword extraction from a single document using word co&#45;occurrence statistical information. <i>International Journal on Artificial Intelligence Tools,</i> 13, 157&#45;169.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061103&pid=S1405-5546201300020000900010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>11. Medelyan, O., Frank, E., &amp; Witten, I. H.</b> <b>(2009).</b> Human&#45;competitive tagging using automatic keyphrase extraction. In <i>Proceedings of EMNLP '09.</i> ACL.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061105&pid=S1405-5546201300020000900011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>12. Medelyan, O., Witten, I. H., &amp; Milne, D. (2008).</b> Topic indexing with Wikipedia. In <i>Proceedings of the Wikipedia and AI workshop at AAAI&#45;08.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061107&pid=S1405-5546201300020000900012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>13. Rao, W., Chen, L., Hui, P., &amp; Tarkoma, S. (2012).</b> Move: A large scale keyword&#45;based content filtering and dissemination system. In <i>IEEE 32nd ICDS.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061109&pid=S1405-5546201300020000900013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>14. Sculley, D. (2010).</b> Combined regression and ranking. In <i>Proceedings of KDD '10.</i> ACM.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061111&pid=S1405-5546201300020000900014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>15. Stuart, R., Dave, E., Nick, C., &amp; Wendy, C. (2010).</b> <i>Text Mining,</i> chapter Automatic Keyword Extraction from Individual Documents. John Wiley &amp; Sons, Ltd, 1&#45;20.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061113&pid=S1405-5546201300020000900015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>16. Turney, P. D. (2000).</b> Learning algorithms for keyphrase extraction. <i>Inf. Retr.,</i> 2(4).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061115&pid=S1405-5546201300020000900016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>17. Vidal, M., Menezes, G. V., Berlt, K., de Moura, E. S., Okada, K., Ziviani, N., Fernandes, D., &amp; Cristo, M. (2012).</b> Selecting keywords to represent web pages using wikipedia information. In <i>Proceedings of WebMedia 12.</i> ACM, USA.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061117&pid=S1405-5546201300020000900017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>18. Witten, I. H., Paynter, G. W., Frank, E., Gutwin, C., &amp; Nevill&#45;Manning, C. G. (1999).</b> Kea: practical automatic keyphrase extraction. In <i>Proceedings of</i> <i>DL 99.</i> ACM, USA.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061119&pid=S1405-5546201300020000900018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>19. Yang, S., Jin, J., Parag, J., &amp; Liu, S. (2010).</b> Contextual advertising for web article printing. In <i>Proceedings of Doc Eng 10.</i> ACM, USA.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061121&pid=S1405-5546201300020000900019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>20. Yih, W.&#45;T., Goodman, J., &amp; Carvalho, V. R. (2006).</b> Finding advertising keywords on web pages. In <i>Proceedings of WWW '06.</i> ACM, USA.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061123&pid=S1405-5546201300020000900020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>21. Zhang, C., Wang, H., Liu, Y., Wu, D., Liao, Y., &amp;</b> <b>Wang, B. (2008).</b> Automatic keyword extraction from documents using conditional random fields. <i>Journal of Computational Information Systems,</i> 1169&#45;1180.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2061125&pid=S1405-5546201300020000900021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Breiman]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Bagging predictors]]></article-title>
<source><![CDATA[Machine Learning]]></source>
<year>1996</year>
<volume>24</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>123-140</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Charton]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Camelin]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Acuna-Agost]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Gotab]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Lavalley]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Kessler]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Fernandez]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Pré-traitements classiques ou par analyse distributionnelle: application aux méthodes de classification automatique déployées pour deft08]]></article-title>
<source><![CDATA[Actes DEFT08-TALN'08]]></source>
<year>2008</year>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[P.-I.]]></given-names>
</name>
<name>
<surname><![CDATA[Lin]]></surname>
<given-names><![CDATA[S.-J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic keyword prediction using google similarity distance]]></article-title>
<source><![CDATA[Expert Systems with Applications]]></source>
<year>2010</year>
<page-range>37</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Eichler]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Neumann]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Dfki keywe: Ranking keyphrases extracted from scientific articles]]></article-title>
<source><![CDATA[Proceedings of the 5th International Workshop on Semantic Evaluation]]></source>
<year>2010</year>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Grineva]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Grinev]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Lizorkin]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Extracting key terms from noisy and multitheme documents]]></article-title>
<source><![CDATA[Proceedings of WWW '09]]></source>
<year>2009</year>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hammouda]]></surname>
<given-names><![CDATA[K. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Matute]]></surname>
<given-names><![CDATA[D. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Kamel]]></surname>
<given-names><![CDATA[M. S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Corephrase: keyphrase extraction for document clustering]]></article-title>
<source><![CDATA[Proceedings of MLDM'05]]></source>
<year>2005</year>
<publisher-name><![CDATA[Springer-Verlag]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hulth]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Improved automatic keyword extraction given more linguistic knowledge]]></article-title>
<source><![CDATA[Proceedings of EMNLP '03]]></source>
<year>2003</year>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kim]]></surname>
<given-names><![CDATA[S. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Medelyan]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Kan]]></surname>
<given-names><![CDATA[M.-Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Baldwin]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Semeval-2010 task 5: Automatic keyphrase extraction from scientific articles]]></article-title>
<source><![CDATA[Proceedings of the 5th International Workshop on Semantic Evaluation]]></source>
<year>2010</year>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lopez]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Romary]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Humb: Automatic key term extraction from scientific articles in grobid]]></article-title>
<source><![CDATA[Proceedings of the 5th International Workshop on Semantic Evaluation]]></source>
<year>2010</year>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Matsuo]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Ishizuka]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Keyword extraction from a single document using word co-occurrence statistical information]]></article-title>
<source><![CDATA[International Journal on Artificial Intelligence Tools]]></source>
<year>2004</year>
<edition>13</edition>
<page-range>157-169</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Medelyan]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Frank]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Witten]]></surname>
<given-names><![CDATA[I. H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Human-competitive tagging using automatic keyphrase extraction]]></article-title>
<source><![CDATA[Proceedings of EMNLP '09]]></source>
<year>2009</year>
<publisher-name><![CDATA[ACL]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Medelyan]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Witten]]></surname>
<given-names><![CDATA[I. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Milne]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Topic indexing with Wikipedia]]></article-title>
<source><![CDATA[Proceedings of the Wikipedia and AI workshop at AAAI-08]]></source>
<year>2008</year>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rao]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Hui]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Tarkoma]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Move: A large scale keyword-based content filtering and dissemination system]]></article-title>
<source><![CDATA[IEEE 32nd ICDS]]></source>
<year>2012</year>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sculley]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Combined regression and ranking]]></article-title>
<source><![CDATA[Proceedings of KDD '10. ACM]]></source>
<year>2010</year>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Stuart]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Dave]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Nick]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Wendy]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Text Mining, chapter Automatic Keyword Extraction from Individual Documents]]></source>
<year>2010</year>
<page-range>1-20</page-range><publisher-name><![CDATA[John Wiley & Sons, Ltd]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Turney]]></surname>
<given-names><![CDATA[P. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Learning algorithms for keyphrase extraction]]></article-title>
<source><![CDATA[Inf. Retr.]]></source>
<year>2000</year>
<volume>2</volume>
<numero>4</numero>
<issue>4</issue>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vidal]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Menezes]]></surname>
<given-names><![CDATA[G. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Berlt]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[de Moura]]></surname>
<given-names><![CDATA[E. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Okada]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Ziviani]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Fernandes]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Cristo]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Selecting keywords to represent web pages using wikipedia information]]></article-title>
<source><![CDATA[Proceedings of WebMedia 12]]></source>
<year>2012</year>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Witten]]></surname>
<given-names><![CDATA[I. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Paynter]]></surname>
<given-names><![CDATA[G. W.]]></given-names>
</name>
<name>
<surname><![CDATA[Frank]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Gutwin]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Nevill-Manning]]></surname>
<given-names><![CDATA[C. G.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Kea: practical automatic keyphrase extraction]]></article-title>
<source><![CDATA[Proceedings of DL 99]]></source>
<year>1999</year>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Jin]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Parag]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Contextual advertising for web article printing]]></article-title>
<source><![CDATA[Proceedings of Doc Eng 10]]></source>
<year>2010</year>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yih]]></surname>
<given-names><![CDATA[W.-T.]]></given-names>
</name>
<name>
<surname><![CDATA[Goodman]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Carvalho]]></surname>
<given-names><![CDATA[V. R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Finding advertising keywords on web pages]]></article-title>
<source><![CDATA[Proceedings of WWW '06]]></source>
<year>2006</year>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Wu]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Liao]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automatic keyword extraction from documents using conditional random fields]]></article-title>
<source><![CDATA[Journal of Computational Information Systems]]></source>
<year>2008</year>
<page-range>1169-1180</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
