Detection of Hate Speech Texts Using Machine Learning Algorithm

被引:6
|
作者
Sanoussi, Mahamat Saleh Adoum [1 ]
Chen Xiaohua [1 ]
Agordzo, George K. [2 ]
Guindo, Mahamed Lamine [3 ]
Al Omari, Abdullah Mma [1 ]
Issa, Boukhari Mahamat [4 ]
机构
[1] Huzhou Univ, Sch Informat Engn, Huzhou, Zhejiang, Peoples R China
[2] Anhui Univ Sci & Technol, Sch Math & Big Data, Hefei, Anhui, Peoples R China
[3] Zhejiang Univ, Coll Biosyst Engn, Hangzhou, Peoples R China
[4] Abeche Inst Sci & Technol, Dept Elect Engn, Abeche, Chad
关键词
hate speech; natural language processing; social media; text classification; word embedding;
D O I
10.1109/CCWC54503.2022.9720792
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Identifying hate speech on social media has become increasingly crucial for society. It has been shown that cyberbullying significantly affects the social tranquillity of the Chadian population, mainly in places of conflict. This article aims to detect hate speech for texts written in "lingua franca", a mix of the local Chadian and French languages. The dataset consists of 14,000 comments extracted from the most visited Facebook pages and annotated in four categories (hate, offence, insult and neutral) were used for this study. The data were cleaned by Natural Language Processing techniques (NLP) and applied to three word embedding methods such as Word2Vec, Doc2Vec, and Fasttext. Finally, four Machine Learning methods, namely Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and K-Nearest Neighbours (KNN), were computed to classify the different categories. The result showed that FastText features representation as input to SVM classifier was the best with 95.4% accuracy for predicting the comment contained insult statement followed by hate statement 93.9%. The result demonstrated our model could be used to detect the hate speech made by Chadians on social media texts.
引用
收藏
页码:266 / 273
页数:8
相关论文
共 50 条
  • [1] Twitter Hate Speech Detection using Machine Learning
    Janardhan, G.
    Saikiran, Bollu
    Reddy, Inugala Swanith
    Abhishek, Mogilicherla
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 270 - 278
  • [2] Hate Speech Detection Using Text Mining and Machine Learning
    Alaoui, Safae Sossi
    Farhaoui, Yousef
    Aksasse, Brahim
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [3] Bangla Hate Speech Detection in Videos Using Machine Learning
    Junaid, Mohd Istiaq Hossain
    Hossain, Faisal
    Rahman, Rashedur M.
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 347 - 351
  • [4] Multi-modal Hate Speech Detection using Machine Learning
    Boishakhi, Fariha Tahosin
    Shill, Ponkoj Chandra
    Alam, Md Golam Rabiul
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4496 - 4499
  • [5] Automatic Hate Speech Detection using Machine Learning: A Comparative Study
    Abro, Sindhu
    Shaikh, Sarang
    Ali, Zafar
    Khan, Sajid
    Mujtaba, Ghulam
    Khand, Zahid Hussain
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (08) : 484 - 491
  • [6] Automatic hate speech detection in audio using machine learning algorithms
    Imbwaga J.L.
    Chittaragi N.B.
    Koolagudi S.G.
    International Journal of Speech Technology, 2024, 27 (02) : 447 - 469
  • [7] Sinhala Hate Speech Detection in Social Media Using Machine Learning and Deep Learning
    Fernando, W. S. S.
    Weerasinghe, Ruvan
    Bandara, E. R. A. D.
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [8] Hate Speech Detection in Social Networks using Machine Learning and Deep Learning Methods
    Toktarova, Aigerim
    Syrlybay, Dariga
    Myrzakhmetova, Bayan
    Anuarbekova, Gulzat
    Rakhimbayeva, Gulbarshin
    Zhylanbaeva, Balkiya
    Suieuova, Nabat
    Kerimbekov, Mukhtar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 396 - 406
  • [9] Correction: Automatic hate speech detection in audio using machine learning algorithms
    Joan L. Imbwaga
    Nagaratna B. Chittaragi
    Shashidhar G. Koolagudi
    International Journal of Speech Technology, 2025, 28 (1) : 313 - 313
  • [10] Hate Speech is not Free Speech: Explainable Machine Learning for Hate Speech Detection in Code-Mixed Languages
    Yadav, Sargam
    Kaushik, Abhishek
    McDaid, Kevin
    2023 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGY AND SOCIETY, ISTAS, 2023,