<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462005000200003</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Document Indexing with a Concept Hierarchy]]></article-title>
<article-title xml:lang="es"><![CDATA[Índice de Documentos con una Jerarquía de Conceptos]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[Alexander]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[Grigori]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Guzmán-Arenas]]></surname>
<given-names><![CDATA[Adolfo]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,National Polytechnic Institute (IPN) Center for Computing Research (CIC) ]]></institution>
<addr-line><![CDATA[DF México]]></addr-line>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2005</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2005</year>
</pub-date>
<volume>8</volume>
<numero>4</numero>
<fpage>281</fpage>
<lpage>292</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462005000200003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462005000200003&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462005000200003&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Given a large hierarchical concept dictionary (thesaurus, or ontology), the task of selection of the concepts that describe the contents of a given document is considered. A statistical method of document indexing driven by such a dictionary is proposed. The method is insensible to inaccuracies in the dictionary, which allow for semi-automatic translation of the hierarchy into difíerent languages. The problem of handling non-terminal and especially top-level nodes in the hierarchy is discussed. Common sense-complaint methods of automatically assigning the weights to the nodes and links in the hierarchyare presented. The application of the method in the Classifier system is discussed.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[Se considera la tarea de la selección de los conceptos que describen el contenido de un documento dado. Los conceptos se eligen de un diccionario. jerárquico grande (un tesauro, o bien una ontología). Se propone un método estadístico para crear un índice de los documentos, guiado por tal diccionario. El método es robusto en cuanto a los errores en el diccionario, lo que permite traducir tal diccionario semiautomáticamente en varios lenguajes. Se discute el problema del uso de los nodos no terminales y especialmente de los nodos de alto nivel en la jerarquía. Se presentan los métodos para ponderación automática de los nodos y vínculos en la jerarquía de la manera en que coincide con los criterios del sentido común. Se discute la aplicación del método en el sistema Classifier.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Document Characterization]]></kwd>
<kwd lng="en"><![CDATA[Document Comparison]]></kwd>
<kwd lng="en"><![CDATA[Ontology]]></kwd>
<kwd lng="en"><![CDATA[Statistical Methods]]></kwd>
<kwd lng="es"><![CDATA[Caracterización de Documentos]]></kwd>
<kwd lng="es"><![CDATA[Comparación de Documentos]]></kwd>
<kwd lng="es"><![CDATA[Ontología]]></kwd>
<kwd lng="es"><![CDATA[Métodos Estadísticos]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="justify"><font face="verdana" size="4">Art&iacute;culos</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="4"><b>Document Indexing with a Concept Hierarchy</b></font></p>     <p align="center">&nbsp;</p>     <p align="center"><font face="verdana" size="4"><i>&Iacute;ndice de Documentos con una Jerarqu&iacute;a de Conceptos</i></font></p>     <p align="center">&nbsp;</p>     <p align="center"><font face="verdana" size="2"><b>Alexander Gelbukh, Grigori Sidorov and Adolfo Guzm&aacute;n&#150;Arenas</b></font></p>     <p align="center">&nbsp;</p>     <p align="center"><font face="verdana" size="2"><i>Natural Language Processing Laboratory,    <br>   Center for Computing Research (CIC), National Polytechnic Institute (IPN),    ]]></body>
<body><![CDATA[<br> Av. Juan de Dios B&aacute;tiz s/n, Esq. Mendiz&aacute;bal, Col. Zacatenco, CP 07738, DF, M&eacute;xico.</i></font></p>     <p align="center">&nbsp;</p>     <p align="center"><font face="verdana" size="2"><b>E&#150;mail:</b> <a href="mailto:gelbukh@gelbukh.com">gelbukh@gelbukh.com</a>, <a href="mailto:gelbukh@gelbukh.com">sidorov@cic.ipn.mx</a>, <a href="mailto:a.guzman@acm.org">a.guzman@acm.org</a></font></p>     <p align="center">&nbsp;</p>     <p align="center"><font size="2" face="verdana"><a href="http://www.gelbukh.com/" target="_blank">www.Gelbukh.com</a></font></p>     <p align="center">&nbsp;</p>     <p align="center"><font size="2" face="verdana"><u>Article received on april13, 2004; accepted on march 15, 2005</u></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>     <p align="justify"><font face="verdana" size="2">Given a large hierarchical concept dictionary (thesaurus, or ontology), the task of selection of the concepts that describe the contents of a given document is considered. A statistical method of document indexing driven by such a dictionary is proposed. The method is insensible to inaccuracies in the dictionary, which allow for semi&#150;automatic translation of the hierarchy into dif&iacute;erent languages. The problem of handling non&#150;terminal and especially top&#150;level nodes in the hierarchy is discussed. Common sense&#150;complaint methods of automatically assigning the weights to the nodes and links in the hierarchyare presented. The application of the method in the Classifier system is discussed.</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Keywords:</b> Document Characterization, Document Comparison, Ontology, Statistical Methods.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Resumen</b></font></p>     <p align="justify"><font face="verdana" size="2">Se considera la tarea de la selecci&oacute;n de los conceptos que describen el contenido de un documento dado. Los conceptos se eligen de un diccionario. jer&aacute;rquico grande (un tesauro, o bien una ontolog&iacute;a). Se propone un m&eacute;todo estad&iacute;stico para crear un &iacute;ndice de los documentos, guiado por tal diccionario. El m&eacute;todo es robusto en cuanto a los errores en el diccionario, lo que permite traducir tal diccionario semiautom&aacute;ticamente en varios lenguajes. Se discute el problema del uso de los nodos no terminales y especialmente de los nodos de alto nivel en la jerarqu&iacute;a. Se presentan los m&eacute;todos para ponderaci&oacute;n autom&aacute;tica de los nodos y v&iacute;nculos en la jerarqu&iacute;a de la manera en que coincide con los criterios del sentido com&uacute;n. Se discute la aplicaci&oacute;n del m&eacute;todo en el sistema <i>Classifier</i>.</font></p>     <p align="justify"><font face="verdana" size="2"><b>Palabras Clave:</b> Caracterizaci&oacute;n de Documentos, Comparaci&oacute;n de Documentos. Ontolog&iacute;a, M&eacute;todos Estad&iacute;sticos.</font></p>     <p align="justify">&nbsp;</p>     <p align="justify"><font size="2" face="verdana"><a href="/pdf/cys/v8n4/v8n4a3.pdf" target="_blank">DESCARGAR ARTICULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Acknowledgments</b></font></p>     <p align="justify"><font face="verdana" size="2">The work was partially supported by Mexican Government (SNI, CONACyT, CGPI&#150;IPN).</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>References</b></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">1. <b>Apt&eacute; Ch; F. Damerau, and Sh. M. Weiss</b>, "Automated learning of decision rules for text categorization". <i>ACM Transactions on Information Systems.</i> Vol. 12, No. 3 (July 1994), pp. 233&#150;251.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049069&pid=S1405-5546200500020000300001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">2. <b>Bharat K. and M. Henzinger</b>, "Improved algorithms for topic distillation in hyper&#150;linked environments", <i>21<sup>st</sup> International ACM SIGIR Conference</i>, 1998.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049070&pid=S1405-5546200500020000300002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">3. <b>Cassidy P.</b>, "An Investigation of the Semantic Relations in the Roget's Thesaurus: Preliminary results", In: <i>Proc. ClCLing&#150;2000, International Conference on Intelligent Text Processing and Computational Linguistics</i>, IPN, Mexico, 2000, 181&#150;204.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049071&pid=S1405-5546200500020000300003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">4. <b>Chakrabarti S.; B. Dom, R. Agrawal, and P. Raghavan</b> "Using taxonomy, discriminants, and signatures for navigating in text databases",<i> 23<sup>rd</sup> VLDB Conference,</i> Athenas, Greece, 1997.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049072&pid=S1405-5546200500020000300004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">5. <b>Cohen W. and Y. Singer,</b> "Context&#150;sensitive Learning Methods for Text Categorization", <i>Proc. of SIGIR'96,</i> 1996.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049073&pid=S1405-5546200500020000300005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">6. <b>Feldman R. and I. Dagan</b>, <i>"Knowledge Discovery in Textual Databases"</i>, Knowledge Discovery and Data Mining, Montreal, Canada, 1995.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049074&pid=S1405-5546200500020000300006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">7. <b>Gelbukh A.</b>, "Using a semantic network for lexical and syntactic disambiguation", <i>Proc. of Simposium Internacional de Computaci&oacute;n: Nuevas Aplicaciones e Innovaciones Tecnol&oacute;gicas en Computaci&oacute;n,</i> November 1997, Mexico.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049075&pid=S1405-5546200500020000300007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">8. <b>Gelbukh A.</b>, "Syntactic disambiguation with weighted extended subcategorization frames". <i>Proc. PACLlNG&#150;99, Pacific Association for Computational Linguistics,</i> Canada, pp. 244&#150;249.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049076&pid=S1405-5546200500020000300008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">9. <b>Gelbukh A., G. Sidorov, and A. Guzm&aacute;n&#150;Arenas</b>, "Document comparison with a weighted topic hierarchy", <i>Proc. 1<sup>st </sup>International Workshop on Document Analysis and Understanding for Document Databases (DAUDD'99), 10<sup>th</sup> International Conference and Workshop on Database and Expert Systems Applications (DEXA),</i> Florence, Italy, September 1, 1999. IEEE Computer Society Press, pp. 566&#150;570.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049077&pid=S1405-5546200500020000300009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">10. <b>Gelbukh A., G. Sidorov, and A. Guzm&aacute;n&#150;Arenas,</b> "A Method of Describing Document Contents through Topic Selection". <i>Proc. of SPIRE'99,Internalional Symposium on String Processing and Information Retrieval</i>, Cancun, Mexico, September 22&#150;24. IEEE Computer Society Press, 1999, pp. 73&#150;80.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049078&pid=S1405-5546200500020000300010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">11. <b>Guzm&aacute;n&#150;Arenas A.</b>, "Finding the main themes in a Spanish document", <i>Expert Systems with Applications,</i> Vol. 14, No. 1/2, Jan/Feb 1998, pp. 139&#150;148.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049079&pid=S1405-5546200500020000300011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">12. <b>Guzm&aacute;n&#150;Arenas A.,</b> "Hallando los temas principales en un art&iacute;culo en espa&ntilde;ol," <i>Soluciones Avanzadas.</i> 1997, Vol. 5, , No. 45, p. 58, No. 49, p. 66.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049080&pid=S1405-5546200500020000300012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">13. <b>Hy&ouml;tyniemi H.</b>, "Text Document Classification with Self&#150;Organizing Maps", in STeP'96, <i>Genes, Nets and Symbols</i>, Alander, J.; Honkela, T.; Jakobsson, M. (eds.), Finnish Artificial Intelligence Society, 1996, pp. 64&#150;72. </font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049081&pid=S1405-5546200500020000300013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">14. <b>Koller D. and M. Sahami,</b> "Hierarchically classifying documents using very few words", <i>International Conference on Machine Learning,</i> 1997, pp. 170&#150;178.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049082&pid=S1405-5546200500020000300014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">15. <b>Krowetz B.</b> "Homonymy and Polysemy in Information Retrieval", <i>35th Annual Meeting of the Association for Computational Linguistics,</i> 1997, pp. 72&#150;79</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049083&pid=S1405-5546200500020000300015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">16. <b>Le D.X., G. Thoma and H. Weschler,</b> "Document Classification using Connectionist Models",<i> IEEE International Conference on Neural Networks,</i> Orlando, FL, June 28 &#150; July 2, 1994, Vol. 5, pp. 3009&#150;3014.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049084&pid=S1405-5546200500020000300016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">17. <b>Light J.,</b> "A distributed, graphical, topic&#150;oriented document search system" <i>CIKM '97, Proceedings of the sixth international conference on Information and knowledge management,</i> 1997, pp. 285&#150;292.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049085&pid=S1405-5546200500020000300017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">18. <b>Niwa Y., Sh. Nishioka, M. Iwayama, A. Takano, and Y. Nitta,</b> "Topie Graph Generation for Query Navigation: Use of Frequeney Classes for Topie Extraetion", NLPRS'97, <i>Natural Language Processing Pacific Rim Symposium '97,</i> Phuket, Thailand, Dee. 1997, pp. 95&#150;100.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049086&pid=S1405-5546200500020000300018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">19. <b>Ponte J. M. and W. B. Croft,</b> "Text Segmentation by Topic", <i>First European Conference on Research and Advanced Technology for Digital Libraries,</i> 1997, pp. 113&#150;125.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049087&pid=S1405-5546200500020000300019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">20. <b>Seymore K. and R. Rosenfeld,</b> "Using story topics for language model adaptation", <i>Proc. 01 Eurospeech '97,</i> 1997.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049088&pid=S1405-5546200500020000300020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">21. <b>WORDNET</b>, <i>Coling&#150;ACL'98 Workshop: Usage of WordNet in Natural Language Processing Systems</i>. August 16, 1998, Universit&eacute; de Montr&eacute;al, Montr&eacute;al, Canada.</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2049089&pid=S1405-5546200500020000300021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Apté]]></surname>
<given-names><![CDATA[Ch]]></given-names>
</name>
<name>
<surname><![CDATA[Damerau]]></surname>
<given-names><![CDATA[F]]></given-names>
</name>
<name>
<surname><![CDATA[Weiss]]></surname>
<given-names><![CDATA[Sh. M]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Automated learning of decision rules for text categorization]]></article-title>
<source><![CDATA[ACM Transactions on Information Systems]]></source>
<year>July</year>
<month> 1</month>
<day>99</day>
<volume>12</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>233-251</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bharat]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[Henzinger]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Improved algorithms for topic distillation in hyper-linked environments]]></source>
<year>1998</year>
<conf-name><![CDATA[ 21st International ACM SIGIR Conference]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cassidy]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An Investigation of the Semantic Relations in the Roget's Thesaurus: Preliminary results]]></article-title>
<source><![CDATA[Proc. ClCLing-2000]]></source>
<year>2000</year>
<conf-name><![CDATA[ International Conference on Intelligent Text Processing and Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>181-204</page-range><publisher-name><![CDATA[IPN]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chakrabarti]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Dom]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Agrawal]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Raghavan]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
</person-group>
<source><![CDATA[Using taxonomy, discriminants, and signatures for navigating in text databases]]></source>
<year>1997</year>
<conf-name><![CDATA[ 23rd VLDB Conference]]></conf-name>
<conf-loc> </conf-loc>
<publisher-loc><![CDATA[Athenas^eGreece Greece]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cohen]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
<name>
<surname><![CDATA[Singer]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Context-sensitive Learning Methods for Text Categorization: Proc. of SIGIR'96]]></article-title>
<source><![CDATA[]]></source>
<year>1996</year>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Feldman]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Dagan]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
</person-group>
<source><![CDATA[Knowledge Discovery in Textual Databases: Knowledge Discovery and Data Mining]]></source>
<year>1995</year>
<publisher-loc><![CDATA[Montreal^eCanada Canada]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Using a semantic network for lexical and syntactic disambiguation]]></source>
<year></year>
<conf-name><![CDATA[ Proc. of Simposium Internacional de Computación: Nuevas Aplicaciones e Innovaciones Tecnológicas en Computación]]></conf-name>
<conf-date>November 1997</conf-date>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Syntactic disambiguation with weighted extended subcategorization frames: Proc. PACLlNG-99, Pacific Association for Computational Linguistics]]></source>
<year></year>
<page-range>244-249</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
<name>
<surname><![CDATA[Guzmán-Arenas]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Document comparison with a weighted topic hierarchy]]></source>
<year>Sept</year>
<month>em</month>
<day>be</day>
<conf-name><![CDATA[ Proc. 1st International Workshop on Document Analysis and Understanding for Document Databases (DAUDD'99), 10th International Conference and Workshop on Database and Expert Systems Applications (DEXA)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>566-570</page-range><publisher-loc><![CDATA[Florence^eItaly Italy]]></publisher-loc>
<publisher-name><![CDATA[IEEE Computer Society Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
<name>
<surname><![CDATA[Guzmán-Arenas]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[A Method of Describing Document Contents through Topic Selection]]></source>
<year>1999</year>
<conf-name><![CDATA[ Proc. of SPIRE'99,Internalional Symposium on String Processing and Information Retrieval]]></conf-name>
<conf-loc> </conf-loc>
<page-range>73-80</page-range><publisher-loc><![CDATA[Cancun^eMexico Mexico]]></publisher-loc>
<publisher-name><![CDATA[IEEE Computer Society Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Guzmán-Arenas]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Finding the main themes in a Spanish document]]></article-title>
<source><![CDATA[Expert Systems with Applications]]></source>
<year>1998</year>
<volume>14</volume>
<numero>1/2</numero>
<issue>1/2</issue>
<page-range>139-148</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Guzmán-Arenas]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<article-title xml:lang="es"><![CDATA[Hallando los temas principales en un artículo en español]]></article-title>
<source><![CDATA[Soluciones Avanzadas]]></source>
<year>1997</year>
<volume>5</volume>
<numero>45</numero><numero>49</numero>
<issue>45</issue><issue>49</issue>
<page-range>58</page-range><page-range>66</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hyötyniemi]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Text Document Classification with Self-Organizing Maps]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Alander]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Honkela]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Jakobsson]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[STeP'96, Genes, Nets and Symbols]]></source>
<year>1996</year>
<page-range>64-72</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Koller]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Sahami]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<source><![CDATA[Hierarchically classifying documents using very few words]]></source>
<year>1997</year>
<conf-name><![CDATA[ International Conference on Machine Learning]]></conf-name>
<conf-loc> </conf-loc>
<page-range>170-178</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Krowetz]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
</person-group>
<source><![CDATA[Homonymy and Polysemy in Information Retrieval]]></source>
<year>1997</year>
<conf-name><![CDATA[ 35th Annual Meeting of the Association for Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>72-79</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Le]]></surname>
<given-names><![CDATA[D.X]]></given-names>
</name>
<name>
<surname><![CDATA[Thoma]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
<name>
<surname><![CDATA[Weschler]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<source><![CDATA[Document Classification using Connectionist Models]]></source>
<year>1994</year>
<volume>5</volume>
<conf-name><![CDATA[ IEEE International Conference on Neural Networks]]></conf-name>
<conf-loc> </conf-loc>
<page-range>3009-3014</page-range><publisher-loc><![CDATA[Orlando^eFL FL]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Light]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<source><![CDATA[A distributed, graphical, topic-oriented document search system" CIKM '97]]></source>
<year>1997</year>
<conf-name><![CDATA[ Proceedings of the sixth international conference on Information and knowledge management]]></conf-name>
<conf-loc> </conf-loc>
<page-range>285-292</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Niwa]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
<name>
<surname><![CDATA[Nishioka]]></surname>
<given-names><![CDATA[Sh]]></given-names>
</name>
<name>
<surname><![CDATA[Iwayama]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Takano]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Nitta]]></surname>
<given-names><![CDATA[Y]]></given-names>
</name>
</person-group>
<source><![CDATA[Topie Graph Generation for Query Navigation: Use of Frequeney Classes for Topie Extraetion]]></source>
<year>1997</year>
<conf-name><![CDATA[ NLPRS'97, Natural Language Processing Pacific Rim Symposium '97]]></conf-name>
<conf-loc> </conf-loc>
<page-range>95-100</page-range><publisher-loc><![CDATA[Phuket^eThailand Thailand]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ponte]]></surname>
<given-names><![CDATA[J. M]]></given-names>
</name>
<name>
<surname><![CDATA[Croft]]></surname>
<given-names><![CDATA[W. B]]></given-names>
</name>
</person-group>
<source><![CDATA[Text Segmentation by Topic]]></source>
<year>1997</year>
<conf-name><![CDATA[ First European Conference on Research and Advanced Technology for Digital Libraries]]></conf-name>
<conf-loc> </conf-loc>
<page-range>113-125</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Seymore]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[Rosenfeld]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<source><![CDATA[Using story topics for language model adaptation: Proc. 01 Eurospeech '97]]></source>
<year>1997</year>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="book">
<collab>WORDNET</collab>
<article-title xml:lang="en"><![CDATA[Coling-ACL'98 Workshop: Usage of WordNet]]></article-title>
<source><![CDATA[Natural Language Processing Systems]]></source>
<year>1998</year>
<publisher-loc><![CDATA[Montréal^eCanada Canada]]></publisher-loc>
<publisher-name><![CDATA[Université de Montréal]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
