Multilingual Hate Speech Detection Using Semi-supervised Generative Adversarial Network

被引：1

作者：

Mnassri, Khouloud ^{[1
]}

Farahbakhsh, Reza ^{[1
]}

Crespi, Noel ^{[1
]}

机构：

[1] Inst Polytech Paris, Samovar, Telecom SudParis, F-91120 Palaiseau, France

来源：

COMPLEX NETWORKS & THEIR APPLICATIONS XII, VOL 4, COMPLEX NETWORKS 2023 | 2024年 / 1144卷

关键词：

Hate Speech; offensive language; semi-supervised; GAN; mBERT; multilingual; social media;

D O I：

10.1007/978-3-031-53503-1_16

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Online communication has overcome linguistic and cultural barriers, enabling global connection through social media platforms. However, linguistic variety introduced more challenges in tasks such as the detection of hate speech content. Although multiple NLP solutions were proposed using advanced machine learning techniques, data annotation scarcity is still a serious problem urging the need for employing semi-supervised approaches. This paper proposes an innovative solution-a multilingual Semi-Supervised model based on Generative Adversarial Networks (GAN) and mBERT models, namely SS-GAN-mBERT. We managed to detect hate speech in Indo-European languages (in English, German, and Hindi) using only 20% labeled data from the HASOC2019 dataset. Our approach excelled in multilingual, zero-shot cross-lingual, and monolingual paradigms, achieving, on average, a 9.23% F1 score boost and 5.75% accuracy increase over baseline mBERT model.

引用

页码：192 / 204

页数：13

共 50 条

[31] Semi-MoreGAN: Semi-supervised Generative Adversarial Network for Mixture of Rain Removal
Shen, Yiyang
Wang, Yongzhen
Wei, Mingqiang
Chen, Honghua
Xie, Haoran
Cheng, Gary
Wang, Fu Lee
COMPUTER GRAPHICS FORUM, 2022, 41 (07) : 443 - 454
[32] A novel semi-supervised method for classification of power quality disturbance using generative adversarial network
Jian, Xianzhong
Wang, Xutao
Jian, Xianzhong (jianxz@usst.edu.cn), 2021, IOS Press BV (40): : 3875 - 3885
[33] A novel semi-supervised method for classification of power quality disturbance using generative adversarial network
Jian, Xianzhong
Wang, Xutao
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 3875 - 3885
[34] Pulsar candidate identification using semi-supervised generative adversarial networks
Balakrishnan, Vishnu
Champion, David
Barr, Ewan
Kramer, Michael
Sengar, Rahul
Bailes, Matthew
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2021, 505 (01) : 1180 - 1194
[35] Localizing Microseismic Events Using Semi-Supervised Generative Adversarial Networks
Feng, Qiang
Han, Liguo
Zhao, Binghui
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[36] Semi-supervised image attribute editing using generative adversarial networks
Dogan, Yahya
Keles, Hacer Yalim
NEUROCOMPUTING, 2020, 401 (401) : 338 - 352
[37] Semi-supervised generative adversarial network framework for modulation recognition of communication signals
Huaji Z.
Jie X.
Shilian Z.
Weiguo S.
Wei W.
Caiyi L.
Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2023, 45 (06): : 78 - 83
[38] A Semi-supervised Generative Adversarial Network Algorithm for Alzheimer's Disease Analysis
Yan, Jian
Gui, Renzhou
Liang, Hao
INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (03):
[39] Attention-Based Generative Adversarial Network for Semi-supervised Image Classification
Xuezhi Xiang
Zeting Yu
Ning Lv
Xiangdong Kong
Abdulmotaleb El Saddik
Neural Processing Letters, 2020, 51 : 1527 - 1540
[40] Attention-Based Generative Adversarial Network for Semi-supervised Image Classification
Xiang, Xuezhi
Yu, Zeting
Lv, Ning
Kong, Xiangdong
El Saddik, Abdulmotaleb
NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1527 - 1540

← 1 2 3 4 5 →