<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1665-6423</journal-id>
<journal-title><![CDATA[Journal of applied research and technology]]></journal-title>
<abbrev-journal-title><![CDATA[J. appl. res. technol]]></abbrev-journal-title>
<issn>1665-6423</issn>
<publisher>
<publisher-name><![CDATA[Universidad Nacional Autónoma de México, Instituto de Ciencias Aplicadas y Tecnología]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1665-64232024000400548</article-id>
<article-id pub-id-type="doi">10.22201/icat.24486736e.2024.22.4.2466</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Hate speech against women and immigrants: A comparative analysis of machine learning and text embedding techniques]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Hussain]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Aslam]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Chang&#8217;an University School of Electronics and Control Engineering ]]></institution>
<addr-line><![CDATA[Xi&#8217;an ]]></addr-line>
<country>China</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Chang&#8217;an University School of Information Engineering ]]></institution>
<addr-line><![CDATA[Xi&#8217;an ]]></addr-line>
<country>China</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>00</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>00</month>
<year>2024</year>
</pub-date>
<volume>22</volume>
<numero>4</numero>
<fpage>548</fpage>
<lpage>559</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1665-64232024000400548&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1665-64232024000400548&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1665-64232024000400548&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract Hate speech on social media, especially against women and immigrants, is a major issue. Twitter, which promotes public discourse and diverse viewpoints, explicitly rejects violence, discrimination, and assaults based on race, nationality, ethnicity, social status, sexual orientation, age, disability, or severe illness. Hate speech harms individuals and communities, but the volume of internet content makes routine detection impractical. This challenge highlights the need to address and develop effective hate speech detection and categorization systems for women and immigrants. This research describes the deployment of two advanced machine learning paradigms, the Random Forest, and Support Vector Machine (SVM), using text pre-processing, post-processing, and advanced text embedding techniques like TF-IDF, CBOW, and GloVE. Detailed categorization of a Twitter dataset into hate speech and subclassification into aggressive and targeted dimensions is the main goal. Model efficacy is carefully assessed based on the complex interaction of text embeddings and classification typology. The Random Forest classifier excels at hate speech categorization when combined with TF-IDF embeddings. Concurrently, merging GloVE embeddings with the SVM algorithm accurately discriminates between aggressive, non-aggressive, targeted, and non-targeted categories. Also, CBOW embeddings work well for broader hate speech classification. Thus, this work improves social media hate speech identification by providing theoretical and practical insights.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Sentiment analysis]]></kwd>
<kwd lng="en"><![CDATA[machine learning]]></kwd>
<kwd lng="en"><![CDATA[Twitter sentiment]]></kwd>
<kwd lng="en"><![CDATA[text embedding]]></kwd>
<kwd lng="en"><![CDATA[speech analysis]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Aslam]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Hussain]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A Performance Analysis of Machine Learning Techniques for Credit Card Fraud Detection]]></article-title>
<source><![CDATA[Journal of Artificial Intelligence]]></source>
<year>2024</year>
<volume>6</volume>
</nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Das]]></surname>
<given-names><![CDATA[K. G.]]></given-names>
</name>
<name>
<surname><![CDATA[Garai]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Das]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Patra]]></surname>
<given-names><![CDATA[B. G.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Profiling Hate Speech Spreaders on Twitter]]></article-title>
<source><![CDATA[CLEF]]></source>
<year>2021</year>
<page-range>1892-8</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Davidson]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Warmsley]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Macy]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Weber]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Automated hate speech detection and the problem of offensive language]]></article-title>
<source><![CDATA[Proceedings of the international AAAI conference on web and social media]]></source>
<year>2017</year>
<volume>11</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>512-5</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Delisle]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Kalaitzis]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Majewski]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[de Berker]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Marin]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Cornebise]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[A large-scale crowdsourced analysis of abuse against women journalists and politicians on Twitter]]></source>
<year>2019</year>
</nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Erdem]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Fighting Infodemic Becomes Must After Covid-19 Pandemic's Onslaught on Truth, Knowledge]]></article-title>
<source><![CDATA[European Journal of Natural Sciences and Medicine]]></source>
<year>2021</year>
<volume>5</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>111-24</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fandos]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Roose]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Facebook identifies an active political influence campaign using fake accounts]]></article-title>
<source><![CDATA[The New York Times]]></source>
<year>2018</year>
</nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fortuna]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Nunes]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A survey on automatic detection of hate speech in text]]></article-title>
<source><![CDATA[ACM Computing Surveys (CSUR)]]></source>
<year>2018</year>
<volume>51</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>1-30</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gupta]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Sehra]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Vardhan]]></surname>
<given-names><![CDATA[Y. R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Hindi-english code mixed hate speech detection using character level embeddings]]></article-title>
<source><![CDATA[2021 5th International Conference on Computing Methodologies and Communication (ICCMC)]]></source>
<year>2021</year>
<page-range>1112-8</page-range><publisher-name><![CDATA[IEEE]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Howard]]></surname>
<given-names><![CDATA[J. W.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Free speech and hate speech]]></article-title>
<source><![CDATA[Annual Review of Political Science]]></source>
<year>2019</year>
<volume>22</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>93-109</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hussain]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Khatoon]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Aslam]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Khosa]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A Comparative Performance Analysis of Machine Learning Models for Intrusion Detection Classification]]></article-title>
<source><![CDATA[Journal of Cybersecurity]]></source>
<year>2024</year>
<volume>6</volume>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hussain]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Aslam]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Cardiovascular Disease Prediction Using Risk Factors: A Comparative Performance Analysis of Machine Learning Models]]></source>
<year>2024</year>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kamble]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Joshi]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Hate speech detection from code-mixed hindi-english tweets using deep learning models]]></source>
<year>2018</year>
</nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Malmasi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Zampieri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Detecting hate speech in social media]]></source>
<year>2017</year>
</nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nasser Alsager]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Towards a Stylometric Authorship Recognition Model for the Social Media Texts in Arabic]]></article-title>
<source><![CDATA[Arab World English Journal]]></source>
<year>2021</year>
<volume>11</volume>
<numero>4</numero>
<issue>4</issue>
</nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Park]]></surname>
<given-names><![CDATA[J. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Fung]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[One-step and two-step classification for abusive language detection on twitter]]></source>
<year>2017</year>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pariyani]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Shah]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Shah]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Vyas]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Degadwala]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Hate speech detection in twitter using natural language processing]]></article-title>
<source><![CDATA[2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV)]]></source>
<year>2021</year>
<page-range>1146-52</page-range><publisher-name><![CDATA[IEEE]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rong]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
</person-group>
<source><![CDATA[word2vec parameter learning explained]]></source>
<year>2014</year>
</nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Salawu]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[He]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Lumsden]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Approaches to automated detection of cyberbullying: A survey]]></article-title>
<source><![CDATA[IEEE Transactions on Affective Computing]]></source>
<year>2017</year>
<volume>11</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>3-24</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schmidt]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Wiegand]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A survey on hate speech detection using natural language processing]]></article-title>
<source><![CDATA[Proceedings of the fifth international workshop on natural language processing for social media]]></source>
<year>2017</year>
<page-range>1-10</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wei]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Gupta]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Umair]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Vovor]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Durzynski]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Offensive language and hate speech detection with deep learning and transfer learning]]></source>
<year>2021</year>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
