SciELO - Scientific Electronic Library Online

 
vol.18 número3Inferring Relations and Annotations in Semantic Network: Application to RadiologyUsing Multi-View Learning to Improve Detection of Investor Sentiments on Twitter índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

LI, Huayi; LIU, Bing; MUKHERJEE, Arjun  e  SHAO, Jidong. Spotting Fake Reviews using Positive-Unlabeled Learning. Comp. y Sist. [online]. 2014, vol.18, n.3, pp.467-475. ISSN 2007-9737.  https://doi.org/10.13053/CyS-18-3-2035.

Fake review detection has been studied by researchers for several years. However, so far all reported studies are based on English reviews. This paper reports a study of detecting fake reviews in Chinese. Our review dataset is from the Chinese review hosting site Dianping, which has built a fake review detection system. They are confident that their algorithm has a very high precision, but they don't know the recall. This means that all fake reviews detected by the system are almost certainly fake but the remaining reviews may not be all genuine. This paper first reports a supervised learning study of two classes, fake and unknown. However, since the unknown set may contain many fake reviews, it is more appropriate to treat it as an unlabeled set. This calls for the model of learning from positive and unlabeled examples (or PU-learning). Experimental results show that PU learning not only outperforms supervised learning significantly, but also detects a large number of potentially fake reviews hidden in the unlabeled set that Dianping fails to detect.

Palavras-chave : Fake reviews; Positive-Unlabeled learning; PU-learning.

        · texto em Inglês     · Inglês ( pdf )

 

Creative Commons License Todo o conteúdo deste periódico, exceto onde está identificado, está licenciado sob uma Licença Creative Commons