Hate Speech Detection Using Large Language Models: A Comprehensive Review

被引:0
|
作者
Albladi, Aish [1 ]
Islam, Minarul [1 ]
Das, Amit [2 ]
Bigonah, Maryam [1 ]
Zhang, Zheng [3 ]
Jamshidi, Fatemeh [4 ]
Rahgouy, Mostafa [1 ]
Raychawdhary, Nilanjana [1 ]
Marghitu, Daniela [1 ]
Seals, Cheryl [1 ]
机构
[1] Auburn Univ, Auburn, AL 36830 USA
[2] Univ North Alabama, Florence, AL 35632 USA
[3] Murray State Univ, Murray, KY 42071 USA
[4] Calif State Polytech Univ Pomona, Pomona, CA 91768 USA
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Deep learning; Encoding; hate speech detection; large language models; machine learning; TURING TEST;
D O I
10.1109/ACCESS.2025.3532397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The widespread use of social media and other online platforms has facilitated unprecedented communication and information exchange. However, it has also led to the spread of hate speech and poses serious challenges to societal harmony as well as individual well-being. Traditional methods for detecting hate speech, such as keyword matching, rule-based systems, and machine learning algorithms, often struggle to capture the subtle and context-dependent nature of hateful content. This paper provides a comprehensive review of the application of large language models (LLMs) like GPT-3, BERT, and their successors in hate speech detection. We analyze the evolution of LLMs in natural language processing and examine their strengths and limitations in identifying hate speech. Additionally, we address the significant challenges and explore how LLMs method can affect the accuracy and fairness of hate speech detection systems. By synthesizing recent research, this review aims to offer a holistic understanding of the current state-of-the-art methods in hate speech detection utilizing LLMs and to suggest directions for future research that could enhance the efficacy and equity of these systems.
引用
收藏
页码:20871 / 20892
页数:22
相关论文
共 50 条
  • [21] Accelerating automatic hate speech detection using parallelized ensemble learning models
    Agarwal, Shivang
    Sonawane, Ankur
    Chowdary, C. Ravindranath
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [22] HATE SPEECH AND THE LANGUAGE OF RACISM IN LATIN AMERICA: A LENS FOR RECONSIDERING GLOBAL HATE SPEECH RESTRICTIONS AND LEGISLATION MODELS
    Hernandez, Tanya Kateri
    UNIVERSITY OF PENNSYLVANIA JOURNAL OF INTERNATIONAL LAW, 2011, 32 (03): : 805 - 841
  • [23] HATECHECK: Functional Tests for Hate Speech Detection Models
    Roettger, Paul
    Vidgen, Bertram
    Dong Nguyen
    Waseem, Zeerak
    Margetts, Helen
    Pierrehumbert, Janet B.
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 41 - 58
  • [24] HateCheckHIn: Evaluating Hindi Hate Speech Detection Models
    Das, Mithun
    Saha, Punyajoy
    Mathew, Binny
    Mukherjee, Animesh
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5378 - 5387
  • [25] Hate Speech and Offensive Language Detection using an Emotion-aware Shared Encoder
    Mnassri, Khouloud
    Rajapaksha, Praboda
    Farahbakhsh, Reza
    Crespi, Noel
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 2852 - 2857
  • [26] Detection of Hate Speech using BERT and Hate Speech Word Embedding with Deep Model
    Saleh, Hind
    Alhothali, Areej
    Moria, Kawthar
    APPLIED ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [27] Hate speech and offensive language detection in Dravidian languages using deep ensemble framework
    Roy, Pradeep Kumar
    Bhawal, Snehaan
    Subalalitha, Chinnaudayar Navaneethakrishnan
    COMPUTER SPEECH AND LANGUAGE, 2022, 75
  • [28] Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context
    Faris, Hossam
    Aljarah, Ibrahim
    Habib, Maria
    Castillo, Pedro A.
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 453 - 460
  • [29] Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models
    Cui, Ziyun
    Lei, Chang
    Wu, Wen
    Duan, Yinan
    Qu, Diyang
    Wu, Ji
    Chen, Runsen
    Zhang, Chao
    INTERSPEECH 2024, 2024, : 2915 - 2919
  • [30] Racial Bias in Hate Speech and Abusive Language Detection Datasets
    Davidson, Thomas
    Bhattacharya, Debasmita
    Weber, Ingmar
    THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 25 - 35