Detection of Hate Speech Texts Using Machine Learning Algorithm

被引:6
|
作者
Sanoussi, Mahamat Saleh Adoum [1 ]
Chen Xiaohua [1 ]
Agordzo, George K. [2 ]
Guindo, Mahamed Lamine [3 ]
Al Omari, Abdullah Mma [1 ]
Issa, Boukhari Mahamat [4 ]
机构
[1] Huzhou Univ, Sch Informat Engn, Huzhou, Zhejiang, Peoples R China
[2] Anhui Univ Sci & Technol, Sch Math & Big Data, Hefei, Anhui, Peoples R China
[3] Zhejiang Univ, Coll Biosyst Engn, Hangzhou, Peoples R China
[4] Abeche Inst Sci & Technol, Dept Elect Engn, Abeche, Chad
关键词
hate speech; natural language processing; social media; text classification; word embedding;
D O I
10.1109/CCWC54503.2022.9720792
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Identifying hate speech on social media has become increasingly crucial for society. It has been shown that cyberbullying significantly affects the social tranquillity of the Chadian population, mainly in places of conflict. This article aims to detect hate speech for texts written in "lingua franca", a mix of the local Chadian and French languages. The dataset consists of 14,000 comments extracted from the most visited Facebook pages and annotated in four categories (hate, offence, insult and neutral) were used for this study. The data were cleaned by Natural Language Processing techniques (NLP) and applied to three word embedding methods such as Word2Vec, Doc2Vec, and Fasttext. Finally, four Machine Learning methods, namely Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and K-Nearest Neighbours (KNN), were computed to classify the different categories. The result showed that FastText features representation as input to SVM classifier was the best with 95.4% accuracy for predicting the comment contained insult statement followed by hate statement 93.9%. The result demonstrated our model could be used to detect the hate speech made by Chadians on social media texts.
引用
收藏
页码:266 / 273
页数:8
相关论文
共 50 条
  • [21] Detection of Hate Tweets using Machine Learning and Deep Learning
    Ketsbaia, Lida
    Issac, Biju
    Chen, Xiaomin
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 751 - 758
  • [22] Improving Sinhala Hate Speech Detection Using Deep Learning
    Gamage, Kavishka
    Welgama, Viraj
    Weerasinghe, Ruvan
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [23] Detection of hate speech in Arabic tweets using deep learning
    Al-Hassan, Areej
    Al-Dossari, Hmood
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974
  • [24] Detection of hate speech in Arabic tweets using deep learning
    Areej Al-Hassan
    Hmood Al-Dossari
    Multimedia Systems, 2022, 28 : 1963 - 1974
  • [25] A comparative analysis of machine learning algorithms for hate speech detection in social media
    Omran, Esraa
    Al Tararwah, Estabraq
    Al Qundus, Jamal
    ONLINE JOURNAL OF COMMUNICATION AND MEDIA TECHNOLOGIES, 2023, 13 (04):
  • [26] Intelligent detection of hate speech in Arabic social network: A machine learning approach
    Aljarah, Ibrahim
    Habib, Maria
    Hijazi, Neveen
    Faris, Hossam
    Qaddoura, Raneem
    Hammo, Bassam
    Abushariah, Mohammad
    Alfawareh, Mohammad
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (04) : 483 - 501
  • [27] Detecting Hate Speech on Twitter Network Using Ensemble Machine Learning
    Mutanga, Raymond T.
    Naicker, Nalindren
    Olugbara, Oludayo O.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 331 - 339
  • [28] Advances in Machine Learning Algorithms for Hate Speech Detection in Social Media: A Review
    Mullah, Nanlir Sallau
    Zainon, Wan Mohd Nazmee Wan
    IEEE ACCESS, 2021, 9 : 88364 - 88376
  • [29] Hate Speech Detection in Indonesian Twitter Texts using Bidirectional Gated Recurrent Unit
    Marpaung, Angela
    Rismala, Rita
    Nurrahmi, Hani
    2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 186 - 190
  • [30] Multilingual Hate Speech Detection: Innovations in Optimized Deep Learning for English and Arabic Hate Speech Detection
    Hassan AL-Sukhani
    Qusay Bsoul
    Abdelrahman H. Elhawary
    Ziad M. Nasr
    Ahmed E. Mansour
    Radwan M. Batyha
    Basma S. Alqadi
    Jehad Saad Alqurni
    Hayat Alfagham
    Magda M. Madbouly
    SN Computer Science, 6 (3)