SciELO - Scientific Electronic Library Online

 
vol.23 número3Joint Learning of Named Entity Recognition and Dependency Parsing using Separate DatasetsKeyVector: Unsupervised Keyphrase Extraction Using Weighted Topic via Semantic Relatedness índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

ZULKHAZHAV, Altanbek; KOZHIRBAYEV, Zhanibek; YESSENBAYEV, Zhandos  e  SHARIPBAY, Altynbek. Kazakh Text Summarization using Fuzzy Logic. Comp. y Sist. [online]. 2019, vol.23, n.3, pp.851-859.  Epub 09-Ago-2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-23-3-3239.

In this paper we present an extractive summarization method for the Kazakh language based on fuzzy logic. We aimed to extract and concatenate important sentences from the primary text to obtain its shorter form. With the rapid growth of information on the Internet there is a demand on its efficient and cost-effective summarization. Therefore the creation of automatic summarization methods is considered as a very important task of natural language processing. Our approach is based on the preprocessing of the sentences by applying morphological analysis and pronoun resolution techniques in order to avoid their early rejections. Afterwards, we determine the features of the processed sentences need for exploiting fuzzy logic methods. Additionally, since there is no available data for the given task, we collected and manually annotated our own dataset from the different Internet resources in the Kazakh language for the experimentation. We also applied our method on CNN/Daily Mail dataset. The ROUGE-N indicators were calculated to assess the quality of the proposed method. The ROUGE-L(f-score) score by the proposed method with pronoun resolution for the former dataset is 0.40, whereas for the latter one it is 0.38.

Palavras-chave : Extractive text summarization; natural language processing; fuzzy logic.

        · texto em Inglês     · Inglês ( pdf )