SciELO - Scientific Electronic Library Online

 
vol.23 issue3Joint Learning of Named Entity Recognition and Dependency Parsing using Separate DatasetsKeyVector: Unsupervised Keyphrase Extraction Using Weighted Topic via Semantic Relatedness author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

ZULKHAZHAV, Altanbek; KOZHIRBAYEV, Zhanibek; YESSENBAYEV, Zhandos  and  SHARIPBAY, Altynbek. Kazakh Text Summarization using Fuzzy Logic. Comp. y Sist. [online]. 2019, vol.23, n.3, pp.851-859.  Epub Aug 09, 2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-23-3-3239.

In this paper we present an extractive summarization method for the Kazakh language based on fuzzy logic. We aimed to extract and concatenate important sentences from the primary text to obtain its shorter form. With the rapid growth of information on the Internet there is a demand on its efficient and cost-effective summarization. Therefore the creation of automatic summarization methods is considered as a very important task of natural language processing. Our approach is based on the preprocessing of the sentences by applying morphological analysis and pronoun resolution techniques in order to avoid their early rejections. Afterwards, we determine the features of the processed sentences need for exploiting fuzzy logic methods. Additionally, since there is no available data for the given task, we collected and manually annotated our own dataset from the different Internet resources in the Kazakh language for the experimentation. We also applied our method on CNN/Daily Mail dataset. The ROUGE-N indicators were calculated to assess the quality of the proposed method. The ROUGE-L(f-score) score by the proposed method with pronoun resolution for the former dataset is 0.40, whereas for the latter one it is 0.38.

Keywords : Extractive text summarization; natural language processing; fuzzy logic.

        · text in English     · English ( pdf )