Services on Demand
Journal
Article
Indicators
- Cited by SciELO
- Access statistics
Related links
- Similars in SciELO
Share
Computación y Sistemas
On-line version ISSN 2007-9737Print version ISSN 1405-5546
Abstract
GELBUKH, Alexander; SIDOROV, Grigori and GUZMAN-ARENAS, Adolfo. Document Indexing with a Concept Hierarchy. Comp. y Sist. [online]. 2005, vol.8, n.4, pp.281-292. ISSN 2007-9737.
Given a large hierarchical concept dictionary (thesaurus, or ontology), the task of selection of the concepts that describe the contents of a given document is considered. A statistical method of document indexing driven by such a dictionary is proposed. The method is insensible to inaccuracies in the dictionary, which allow for semi-automatic translation of the hierarchy into difíerent languages. The problem of handling non-terminal and especially top-level nodes in the hierarchy is discussed. Common sense-complaint methods of automatically assigning the weights to the nodes and links in the hierarchyare presented. The application of the method in the Classifier system is discussed.
Keywords : Document Characterization; Document Comparison; Ontology; Statistical Methods.