SciELO - Scientific Electronic Library Online

 
vol.28 número3Respiratory Disease Pre-Diagnosis through a Novel Pattern Classification Algorithm based on Associative MemoriesCreation of a Corpus in Spanish for the Recognition of Personality Traits índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

RAHMAN-LASKAR, Sahinur; GUPTA, Gauri; BADHANI, Ritika  y  PINTO-AVENDANO, David Eduardo. Cyberbullying Detection in a Multi-classification Codemixed Dataset. Comp. y Sist. [online]. 2024, vol.28, n.3, pp.1091-1113.  Epub 21-Ene-2025. ISSN 2007-9737.  https://doi.org/10.13053/cys-28-3-4989.

In an era characterized by digital communication and social media, the concept of cyberbullying has arisen as a social concern, impacting individuals of all ages. It refers to the act of using digital communication tools like, social media, and messaging apps, to harass intimidate or harm someone. Codemixed cyberbullying refers to the use of multiple languages or a mix of languages in online communications and the use of multiple languages or a mix of languages can sometimes make it challenging for content moderators or automated systems to detect and address cyberbullying effectively. The challenges include the availability of standard codemixed datasets, especially for Indian languages. This paper investigates cyberbullying detection in Hinglish, a code-mixed language of Hindi and English. We have created a novel multi-class Hinglish dataset, annotated across seven cyberbullying categories: age, gender, religion, mockery, abusive, offensive, and not cyberbullying, and explored different machine learning-based models. We have performed a comparative analysis based on the standard evaluation metrics and achieved a state-of-the-art result on a multi-class codemixed Hinglish dataset.

Palabras llave : Cyberbullying; codemixed; Hinglish; machine learning.

        · texto en Inglés     · Inglés ( pdf )