Multilingual Hate Speech Detection Using Semi-supervised Generative Adversarial Network

被引:1
|
作者
Mnassri, Khouloud [1 ]
Farahbakhsh, Reza [1 ]
Crespi, Noel [1 ]
机构
[1] Inst Polytech Paris, Samovar, Telecom SudParis, F-91120 Palaiseau, France
关键词
Hate Speech; offensive language; semi-supervised; GAN; mBERT; multilingual; social media;
D O I
10.1007/978-3-031-53503-1_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online communication has overcome linguistic and cultural barriers, enabling global connection through social media platforms. However, linguistic variety introduced more challenges in tasks such as the detection of hate speech content. Although multiple NLP solutions were proposed using advanced machine learning techniques, data annotation scarcity is still a serious problem urging the need for employing semi-supervised approaches. This paper proposes an innovative solution-a multilingual Semi-Supervised model based on Generative Adversarial Networks (GAN) and mBERT models, namely SS-GAN-mBERT. We managed to detect hate speech in Indo-European languages (in English, German, and Hindi) using only 20% labeled data from the HASOC2019 dataset. Our approach excelled in multilingual, zero-shot cross-lingual, and monolingual paradigms, achieving, on average, a 9.23% F1 score boost and 5.75% accuracy increase over baseline mBERT model.
引用
收藏
页码:192 / 204
页数:13
相关论文
共 50 条
  • [31] Semi-MoreGAN: Semi-supervised Generative Adversarial Network for Mixture of Rain Removal
    Shen, Yiyang
    Wang, Yongzhen
    Wei, Mingqiang
    Chen, Honghua
    Xie, Haoran
    Cheng, Gary
    Wang, Fu Lee
    COMPUTER GRAPHICS FORUM, 2022, 41 (07) : 443 - 454
  • [32] A novel semi-supervised method for classification of power quality disturbance using generative adversarial network
    Jian, Xianzhong
    Wang, Xutao
    Jian, Xianzhong (jianxz@usst.edu.cn), 2021, IOS Press BV (40): : 3875 - 3885
  • [33] A novel semi-supervised method for classification of power quality disturbance using generative adversarial network
    Jian, Xianzhong
    Wang, Xutao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 3875 - 3885
  • [34] Pulsar candidate identification using semi-supervised generative adversarial networks
    Balakrishnan, Vishnu
    Champion, David
    Barr, Ewan
    Kramer, Michael
    Sengar, Rahul
    Bailes, Matthew
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2021, 505 (01) : 1180 - 1194
  • [35] Localizing Microseismic Events Using Semi-Supervised Generative Adversarial Networks
    Feng, Qiang
    Han, Liguo
    Zhao, Binghui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [36] Semi-supervised image attribute editing using generative adversarial networks
    Dogan, Yahya
    Keles, Hacer Yalim
    NEUROCOMPUTING, 2020, 401 (401) : 338 - 352
  • [37] Semi-supervised generative adversarial network framework for modulation recognition of communication signals
    Huaji Z.
    Jie X.
    Shilian Z.
    Weiguo S.
    Wei W.
    Caiyi L.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2023, 45 (06): : 78 - 83
  • [38] A Semi-supervised Generative Adversarial Network Algorithm for Alzheimer's Disease Analysis
    Yan, Jian
    Gui, Renzhou
    Liang, Hao
    INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (03):
  • [39] Attention-Based Generative Adversarial Network for Semi-supervised Image Classification
    Xuezhi Xiang
    Zeting Yu
    Ning Lv
    Xiangdong Kong
    Abdulmotaleb El Saddik
    Neural Processing Letters, 2020, 51 : 1527 - 1540
  • [40] Attention-Based Generative Adversarial Network for Semi-supervised Image Classification
    Xiang, Xuezhi
    Yu, Zeting
    Lv, Ning
    Kong, Xiangdong
    El Saddik, Abdulmotaleb
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1527 - 1540