SciELO - Scientific Electronic Library Online

 
 número44Identifying the User's Intentions: Basic Illocutions in Modern Greek índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Polibits

versão On-line ISSN 1870-9044

Resumo

CHAKRABORTY, Tanmoy  e  BANDYOPADHYAY, Sivaji. Inference of Fine-grained Attributes of Bengali Corpus for Stylometry Detection. Polibits [online]. 2011, n.44, pp. 79-83. ISSN 1870-9044.

Stylometry, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and belongs to the core task of Text categorization that involves authorship identification, plagiarism detection, forensic investigation, computer security, copyright and estáte disputes etc. In this work, we present a strategy for stylometry detection of documents written in Bengali. We adopt a set of fine-grained attribute features with a set of lexical markers for the analysis of the text and use three semi-supervised measures for making decisions. Finally, a majority voting approach has been taken for final classification. The system is fully automatic and language-independent. Evaluation results of our attempt for Bengali author' s stylometry detection show reasonably promising accuracy in comparison to the baseline model.

Palavras-chave : Stylometry; stylistic markers; cosine-similarity; chi-square measure; Euclidean distance.

        · texto em Inglês     · pdf em Inglês