Multilingual Hate Speech Detection Using Semi-supervised Generative Adversarial Network

被引:1
|
作者
Mnassri, Khouloud [1 ]
Farahbakhsh, Reza [1 ]
Crespi, Noel [1 ]
机构
[1] Inst Polytech Paris, Samovar, Telecom SudParis, F-91120 Palaiseau, France
关键词
Hate Speech; offensive language; semi-supervised; GAN; mBERT; multilingual; social media;
D O I
10.1007/978-3-031-53503-1_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online communication has overcome linguistic and cultural barriers, enabling global connection through social media platforms. However, linguistic variety introduced more challenges in tasks such as the detection of hate speech content. Although multiple NLP solutions were proposed using advanced machine learning techniques, data annotation scarcity is still a serious problem urging the need for employing semi-supervised approaches. This paper proposes an innovative solution-a multilingual Semi-Supervised model based on Generative Adversarial Networks (GAN) and mBERT models, namely SS-GAN-mBERT. We managed to detect hate speech in Indo-European languages (in English, German, and Hindi) using only 20% labeled data from the HASOC2019 dataset. Our approach excelled in multilingual, zero-shot cross-lingual, and monolingual paradigms, achieving, on average, a 9.23% F1 score boost and 5.75% accuracy increase over baseline mBERT model.
引用
收藏
页码:192 / 204
页数:13
相关论文
共 50 条
  • [41] CCS-GAN: a semi-supervised generative adversarial network for image classification
    Lei Wang
    Yu Sun
    Zheng Wang
    The Visual Computer, 2022, 38 : 2009 - 2021
  • [42] Semi-supervised Image Classification via Attention Mechanism and Generative Adversarial Network
    Xiang, Xuezhi
    Yu, Zeting
    Lv, Ning
    Kong, Xiangdong
    Saddik, Abdulmotaleb Ei
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [43] Semi-supervised blockwisely architecture search for efficient lightweight generative adversarial network
    Zhang, Man
    Zhou, Yong
    Zhao, Jiaqi
    Xia, Shixiong
    Wang, Jiaqi
    Huang, Zizheng
    PATTERN RECOGNITION, 2021, 112
  • [44] CCS-GAN: a semi-supervised generative adversarial network for image classification
    Wang, Lei
    Sun, Yu
    Wang, Zheng
    VISUAL COMPUTER, 2022, 38 (06): : 2009 - 2021
  • [45] Semi-supervised generative adversarial network with guaranteed safeness for industrial quality prediction
    Zhang, Xu
    Zou, Yuanyuan
    Li, Shaoyuan
    COMPUTERS & CHEMICAL ENGINEERING, 2021, 153
  • [46] Semi-supervised Seizure Prediction with Generative Adversarial Networks
    Nhan Duy Truong
    Zhou, Luping
    Kavehei, Omid
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 2369 - 2372
  • [47] Semi-Supervised Learning with Coevolutionary Generative Adversarial Networks
    Toutouh, Jamal
    Nalluru, Subhash
    Hemberg, Erik
    O'Reilly, Una-May
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 568 - 576
  • [48] Semi-Supervised Dose Prediction with Generative Adversarial Learning
    Lam, D.
    Sun, B.
    MEDICAL PHYSICS, 2019, 46 (06) : E418 - E418
  • [49] Semi-supervised Learning on Graphs with Generative Adversarial Nets
    Ding, Ming
    Tang, Jie
    Zhang, Jie
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 913 - 922
  • [50] Semi-supervised Generative Adversarial Hashing for Image Retrieval
    Wang, Guan'an
    Hu, Qinghao
    Cheng, Jian
    Hou, Zengguang
    COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 491 - 507