Improving hate speech detection using Cross-Lingual Learning

被引:8
|
作者
Firmino, Anderson Almeida [1 ]
Baptista, Claudio de Souza [1 ]
de Paiva, Anselmo Cardoso [2 ]
机构
[1] Univ Fed Campina Grande, Rua Aprigio Veloso 882, Campina Grande, PB, Brazil
[2] Univ Fed Maranhao, Ave Portugueses 1966, Sao Luis, MA, Brazil
关键词
Hate speech detection; Natural language processing; Social media; Cross-Lingual Learning; Deep learning;
D O I
10.1016/j.eswa.2023.121115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growth of social media worldwide has brought social benefits and challenges. One problem we highlight is the proliferation of hate speech on social media. We propose a novel method for detecting hate speech in texts using Cross-Lingual Learning. Our approach uses transfer learning from Pre-Trained Language Models (PTLM) with large corpora available to solve problems in languages with fewer resources for the specific task. The proposed methodology comprises four stages: corpora acquisition, the PTLM definition, training strategies, and evaluation. We carried out experiments using Pre-Trained Language Models in English, Italian, and Portuguese (BERT and XLM-R) to verify which best suited the proposed method. We used corpora in English (WH) and Italian (Evalita 2018) as the source language and the OffComBr-2 corpus in Portuguese (the target language). The results of the experiments showed that the proposed methodology is promising: for the OffComBr-2 corpus, the best state-of-the-art result was obtained (F1-measure = 92%).
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Enhancing cross-lingual hate speech detection through contrastive and adversarial learning
    Almahdi, Asseel Jabbar
    Mohades, Ali
    Akbari, Mohammad
    Heidary, Soroush
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 147
  • [2] Metalinguist: enhancing hate speech detection with cross-lingual meta-learning
    Hashmi, Ehtesham
    Yayilgan, Sule Yildirim
    Abomhara, Mohamed
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)
  • [3] Cross-Lingual Few-Shot Hate Speech and Offensive Language Detection Using Meta Learning
    Mozafari, Marzieh
    Farahbakhsh, Reza
    Crespi, Noel
    IEEE ACCESS, 2022, 10 : 14880 - 14896
  • [4] Cross-lingual hate speech detection using domain-specific word embeddings
    Monnar, Ayme Arango
    Rojas, Jorge Perez
    Labra, Barbara Polete
    PLOS ONE, 2024, 19 (07):
  • [5] Cross-lingual Capsule Network for Hate Speech Detection in Social Media
    Jiang, Aiqi
    Zubiaga, Arkaitz
    PROCEEDINGS OF THE 32ND ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT '21), 2021, : 217 - 223
  • [6] Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection
    Nozza, Debora
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 907 - 914
  • [7] A cross-lingual transfer learning method for online COVID-19-related hate speech detection
    Liu, Lin
    Xu, Duo
    Zhao, Pengfei
    Zeng, Daniel Dajun
    Hu, Paul Jen-Hwa
    Zhang, Qingpeng
    Luo, Yin
    Cao, Zhidong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
  • [8] A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection
    Pamungkas, Endang Wahyu
    Basile, Valerio
    Patti, Viviana
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [9] Label modification and bootstrapping for zero-shot cross-lingual hate speech detection
    Bigoulaeva, Irina
    Hangya, Viktor
    Gurevych, Iryna
    Fraser, Alexander
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (04) : 1515 - 1546
  • [10] Label modification and bootstrapping for zero-shot cross-lingual hate speech detection
    Irina Bigoulaeva
    Viktor Hangya
    Iryna Gurevych
    Alexander Fraser
    Language Resources and Evaluation, 2023, 57 : 1515 - 1546