Detection of Hate Speech Texts Using Machine Learning Algorithm

被引：6

作者：

Sanoussi, Mahamat Saleh Adoum ^{[1
]}

Chen Xiaohua ^{[1
]}

Agordzo, George K. ^{[2
]}

Guindo, Mahamed Lamine ^{[3
]}

Al Omari, Abdullah Mma ^{[1
]}

Issa, Boukhari Mahamat ^{[4
]}

机构：

[1] Huzhou Univ, Sch Informat Engn, Huzhou, Zhejiang, Peoples R China

[2] Anhui Univ Sci & Technol, Sch Math & Big Data, Hefei, Anhui, Peoples R China

[3] Zhejiang Univ, Coll Biosyst Engn, Hangzhou, Peoples R China

[4] Abeche Inst Sci & Technol, Dept Elect Engn, Abeche, Chad

来源：

2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC) | 2022年

关键词：

hate speech; natural language processing; social media; text classification; word embedding;

D O I：

10.1109/CCWC54503.2022.9720792

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Identifying hate speech on social media has become increasingly crucial for society. It has been shown that cyberbullying significantly affects the social tranquillity of the Chadian population, mainly in places of conflict. This article aims to detect hate speech for texts written in "lingua franca", a mix of the local Chadian and French languages. The dataset consists of 14,000 comments extracted from the most visited Facebook pages and annotated in four categories (hate, offence, insult and neutral) were used for this study. The data were cleaned by Natural Language Processing techniques (NLP) and applied to three word embedding methods such as Word2Vec, Doc2Vec, and Fasttext. Finally, four Machine Learning methods, namely Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and K-Nearest Neighbours (KNN), were computed to classify the different categories. The result showed that FastText features representation as input to SVM classifier was the best with 95.4% accuracy for predicting the comment contained insult statement followed by hate statement 93.9%. The result demonstrated our model could be used to detect the hate speech made by Chadians on social media texts.

引用

页码：266 / 273

页数：8

共 50 条

[21] Detection of Hate Tweets using Machine Learning and Deep Learning
Ketsbaia, Lida
Issac, Biju
Chen, Xiaomin
2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 751 - 758
[22] Improving Sinhala Hate Speech Detection Using Deep Learning
Gamage, Kavishka
Welgama, Viraj
Weerasinghe, Ruvan
2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
[23] Detection of hate speech in Arabic tweets using deep learning
Al-Hassan, Areej
Al-Dossari, Hmood
MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974
[24] Detection of hate speech in Arabic tweets using deep learning
Areej Al-Hassan
Hmood Al-Dossari
Multimedia Systems, 2022, 28 : 1963 - 1974
[25] A comparative analysis of machine learning algorithms for hate speech detection in social media
Omran, Esraa
Al Tararwah, Estabraq
Al Qundus, Jamal
ONLINE JOURNAL OF COMMUNICATION AND MEDIA TECHNOLOGIES, 2023, 13 (04):
[26] Intelligent detection of hate speech in Arabic social network: A machine learning approach
Aljarah, Ibrahim
Habib, Maria
Hijazi, Neveen
Faris, Hossam
Qaddoura, Raneem
Hammo, Bassam
Abushariah, Mohammad
Alfawareh, Mohammad
JOURNAL OF INFORMATION SCIENCE, 2021, 47 (04) : 483 - 501
[27] Detecting Hate Speech on Twitter Network Using Ensemble Machine Learning
Mutanga, Raymond T.
Naicker, Nalindren
Olugbara, Oludayo O.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 331 - 339
[28] Advances in Machine Learning Algorithms for Hate Speech Detection in Social Media: A Review
Mullah, Nanlir Sallau
Zainon, Wan Mohd Nazmee Wan
IEEE ACCESS, 2021, 9 : 88364 - 88376
[29] Hate Speech Detection in Indonesian Twitter Texts using Bidirectional Gated Recurrent Unit
Marpaung, Angela
Rismala, Rita
Nurrahmi, Hani
2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 186 - 190
[30] Multilingual Hate Speech Detection: Innovations in Optimized Deep Learning for English and Arabic Hate Speech Detection
Hassan AL-Sukhani
Qusay Bsoul
Abdelrahman H. Elhawary
Ziad M. Nasr
Ahmed E. Mansour
Radwan M. Batyha
Basma S. Alqadi
Jehad Saad Alqurni
Hayat Alfagham
Magda M. Madbouly
SN Computer Science, 6 (3)

← 1 2 3 4 5 →