SciELO - Scientific Electronic Library Online

 
vol.22 número1New Similarity Function for Scientific Articles Clustering based on the Bibliographic ReferencesAutomatic Theorem Proving for Natural Logic: A Case Study on Textual Entailment índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

GELBUKH, Alexander. Inferences for Enrichment of Collocation Databases by Means of Semantic Relations. Comp. y Sist. [online]. 2018, vol.22, n.1, pp.103-117. ISSN 2007-9737.  https://doi.org/10.13053/cys-22-1-2923.

A text consists of words that are syntactically linked and semantically combinable—like “political party,” “pay attention,” or “stone cold.” Such semantically plausible combinations of two content words, which we hereafter refer to as collocations, are important knowledge in many areas of computational linguistics. We present the structure of a lexical resource that provides such knowledge—a collocation database (CBD). Since such databases cannot be complete under any reasonable compilation procedure, we consider heuristic-based inference mechanisms that predict new plausible collocations based on the ones present in the CDB, with the help of a WordNet-like thesaurus: if an available collocation combines the entries A and B, and B is ‘similar’ to C, then A and C are supposed to constitute a collocation of the same category. Also, we describe the semantically induced morphological categories suiting for such inference, as well as the heuristics for filtering out wrong hypotheses. We discuss the experience in inferences obtained with CrossLexica CDB.

Palavras-chave : Collocations; inference rules; enrichment; synonyms; hypernyms; meronyms.

        · texto em Inglês     · Inglês ( pdf )