SciELO - Scientific Electronic Library Online

 
vol.24 número3Neuro-Fuzzy Cognitive Temporal Models for Predicting Multidimensional Time Series with Fuzzy TrendsDigital Image Steganography Scheme for Speech Signals in the Discrete Wavelet Transform Domain índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

SILVA, Samuel Caetano da; FERREIRA, Thiago Castro; RAMOS, Ricelli Moreira Silva  e  PARABONI, Ivandré. Data-Driven and Psycholinguistics-Motivated Approaches to Hate Speech Detection. Comp. y Sist. [online]. 2020, vol.24, n.3, pp.1179-1188.  Epub 09-Jun-2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-24-3-3478.

Computational models of hate speech detection and related tasks (e.g., detecting misogyny, racism, xenophobia, homophobia etc.) have emerged as major Natural Language Processing (NLP) research topics in recent years. In the present work, we investigate a range of alternative implementations of three of these tasks - namely, hate speech, aggressive behavior and target group recognition -by presenting a number of experiments involving different learning methods, including regularized logistic regression, convolutional neural networks (CNN) and deep bidirectional transformers (BERT), and using word embeddings, word n-grams, character n-grams and psycholinguistics-motivated (LIWC) features alike. Results suggest that a purely data-driven BERT model, and to some extent also a hybrid psycholinguisticly informed CNN model, generally outperform the alternatives under consideration for all tasks in both English and Spanish languages.

Palavras-chave : Natural language processing; hate speech; aggressive language detection.

        · texto em Inglês     · Inglês ( pdf )