Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning

被引:3
|
作者
Aljuhani, Khulood O. [1 ]
Alyoubi, Khaled H. [1 ]
Alotaibi, Fahd S. [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Informat Syst Dept, Jeddah, Saudi Arabia
来源
TEHNICKI GLASNIK-TECHNICAL JOURNAL | 2022年 / 16卷 / 03期
关键词
Arabic Natural Language Processing; Arabic Tweets; Offensive Language Detection; Offensive Language; Word Embeddings;
D O I
10.31803/tg-20220305120018
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years, social media networks are emerging as a key player by providing platforms for opinions expression, communication, and content distribution. However, users often take advantage of perceived anonymity on social media platforms to share offensive or hateful content. Thus, offensive language has grown as a significant issue with the increase in online communication and the popularity of social media platforms. This problem has attracted significant attention for devising methods for detecting offensive content and preventing its spread on online social networks. Therefore, this paper aims to develop an effective Arabic offensive language detection model by employing deep learning and semantic and contextual features. This paper proposes a deep learning approach that utilizes the bidirectional long short-term memory (BiLSTM) model and domain-specific word embeddings extracted from an Arabic offensive dataset. The detection approach was evaluated on an Arabic dataset collected from Twitter. The results showed the highest performance accuracy of 0.93% with the BiLSTM model trained using a combination of domain-specific and agnostic-domain word embeddings.
引用
收藏
页码:394 / 400
页数:7
相关论文
共 50 条
  • [31] ES2Vec: Earth Science Metadata Keyword Assignment using Domain-Specific Word Embeddings
    Ramasubramanian, Muthukumaran
    Muhammad, Hassan
    Gurung, Iksha
    Maskey, Manil
    Ramachandran, Rahul
    IEEE SOUTHEASTCON 2020, 2020,
  • [32] Genre Classification using Word Embeddings and Deep Learning
    Kumar, Akshi
    Rajpal, Arjun
    Rathore, Dushyant
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2142 - 2146
  • [33] Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context
    Faris, Hossam
    Aljarah, Ibrahim
    Habib, Maria
    Castillo, Pedro A.
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 453 - 460
  • [34] Deep Learning for Domain-Specific Action Recognition in Tennis
    Mora, Silvia Vinyes
    Knottenbelt, William J.
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 170 - 178
  • [35] Detecting Hateful and Offensive Speech in Arabic Social Media Using Transfer Learning
    Boulouard, Zakaria
    Ouaissa, Mariya
    Ouaissa, Mariyam
    Krichen, Moez
    Almutiq, Mutiq
    Gasmi, Karim
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [36] Language Models Learning for Domain-Specific Natural Language User Interaction
    Bai, Shuanhu
    Huang, Chien-Lin
    Tan, Yeow-Kee
    Ma, Bin
    2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4, 2009, : 2480 - 2485
  • [37] Learning Domain-Specific, L1-Specific Measures of Word Readability
    Bergsma, Shane
    Yarowsky, David
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2013, 54 (01): : 203 - 226
  • [38] Augmenting Large Language Models via Vector Embeddings to Improve Domain-specific Responsiveness
    Wolfrath, Nathan M.
    Verhagen, Nathaniel B.
    Crotty, Bradley H.
    Somai, Melek
    Kothari, Anai N.
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2024, (214):
  • [39] Development and evaluation of novel ophthalmology domain-specific neural word embeddings to predict visual prognosis
    Wang, Sophia
    Tseng, Benjamin
    Hernandez-Boussard, Tina
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 150
  • [40] Using a Domain-Specific Language to Enrich ETL Schemas
    Belo, Orlando
    Gomes, Claudia
    Oliveira, Bruno
    Marques, Ricardo
    Santos, Vasco
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2015), 2015, 539 : 28 - 35