SciELO - Scientific Electronic Library Online

 
vol.28 número2Optimizing the Performance of the IDS through Feature-Relevant Selection Using PSO and Random Forest TechniquesMulti-Class Sentiment Analysis of COVID-19 Tweets by Machine Learning and Deep Learning Approaches índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

CASTILLO, Esteban  y  CERVANTES, Ofelia. Mining a Trending Topic: U.S. Immigration on the Context of Social Media. Comp. y Sist. [online]. 2024, vol.28, n.2, pp.489-505.  Epub 31-Oct-2024. ISSN 2007-9737.  https://doi.org/10.13053/cys-28-2-4574.

This paper presents a text mining approach for extracting valuable patterns from social media documents in the context of U.S. immigration. The paper points out the uncovering of statistical features alongside linguistic elements based on graph techniques. The use of graphs provide rich data structures for representing lexical and syntactic aspects of texts, allowing the discovery of complex patterns that used by experts could provide valuable insight. The proposed method is applied over a Twitter-X/-Reddit dataset that comprise English and Spanish language samples from 2016 up to 2019. Experimental results showed that our interpretation of classic statistic techniques provide a baseline understanding of the topic while a more robust analysis (graphs) permits to uncover/predict hidden patterns over large amount of samples. In particular, the use of a co-occurrence graph helped to obtain relevant words, phrases and sentences while a user-interaction graph allow to detect important users, communities and interactions among themselves.

Palabras llave : Text mining; statistics; graph mining; social network analysis; natural language processing; big data.

        · texto en Inglés     · Inglés ( pdf )