Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning

被引:3
|
作者
Aljuhani, Khulood O. [1 ]
Alyoubi, Khaled H. [1 ]
Alotaibi, Fahd S. [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Informat Syst Dept, Jeddah, Saudi Arabia
来源
TEHNICKI GLASNIK-TECHNICAL JOURNAL | 2022年 / 16卷 / 03期
关键词
Arabic Natural Language Processing; Arabic Tweets; Offensive Language Detection; Offensive Language; Word Embeddings;
D O I
10.31803/tg-20220305120018
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years, social media networks are emerging as a key player by providing platforms for opinions expression, communication, and content distribution. However, users often take advantage of perceived anonymity on social media platforms to share offensive or hateful content. Thus, offensive language has grown as a significant issue with the increase in online communication and the popularity of social media platforms. This problem has attracted significant attention for devising methods for detecting offensive content and preventing its spread on online social networks. Therefore, this paper aims to develop an effective Arabic offensive language detection model by employing deep learning and semantic and contextual features. This paper proposes a deep learning approach that utilizes the bidirectional long short-term memory (BiLSTM) model and domain-specific word embeddings extracted from an Arabic offensive dataset. The detection approach was evaluated on an Arabic dataset collected from Twitter. The results showed the highest performance accuracy of 0.93% with the BiLSTM model trained using a combination of domain-specific and agnostic-domain word embeddings.
引用
收藏
页码:394 / 400
页数:7
相关论文
共 50 条
  • [41] An unsupervised incremental learning algorithm for domain-specific language development
    Javed, Faizan
    Mernik, Marjan
    Bryant, Barrett R.
    Sprague, Alan
    APPLIED ARTIFICIAL INTELLIGENCE, 2008, 22 (7-8) : 707 - 729
  • [42] Detection of Arabic offensive language in social media using machine learning models
    Mousa, Aya
    Shahin, Ismail
    Nassif, Ali Bou
    Elnagar, Ashraf
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 22
  • [43] Fall Detection in EHR using Word Embeddings and Deep Learning
    dos Santos, Henrique D. P.
    Silva, Amanda P.
    Maciel, Maria Carolina O.
    Burin, Haline Maria V.
    Urbanetto, Janete S.
    Vieira, Renata
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 265 - 268
  • [44] Lexical Function Identification Using Word Embeddings and Deep Learning
    Hernandez-Miranda, Arturo
    Gelbukh, Alexander
    Kolesnikova, Olga
    ADVANCES IN SOFT COMPUTING, MICAI 2019, 2019, 11835 : 77 - 86
  • [45] Detecting Environmental, Social and Governance (ESG) Topics Using Domain-Specific Language Models and Data Augmentation
    Nugent, Tim
    Stelea, Nicole
    Leidner, Jochen L.
    FLEXIBLE QUERY ANSWERING SYSTEMS (FQAS 2021), 2021, 12871 : 157 - 169
  • [46] ReSIL: Revivifying Function Signature Inference using Deep Learning with Domain-Specific Knowledge
    Lin, Yan
    Gao, Debin
    Lo, David
    CODASPY'22: PROCEEDINGS OF THE TWELVETH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2022, : 107 - 118
  • [47] Learning and using domain-specific heuristics in ASP solvers
    Balduccini, Marcello
    AI COMMUNICATIONS, 2011, 24 (02) : 147 - 164
  • [48] Debugging measurement systems using a domain-specific modeling language
    Kosar, Tomaz
    Mernik, Marjan
    Gray, Jeff
    Kos, Tomaz
    COMPUTERS IN INDUSTRY, 2014, 65 (04) : 622 - 635
  • [49] A Novel Approach using Alloy in Domain-specific Language Engineering
    Moreira, Rodrigo M. L. M.
    Paiva, Ana C. R.
    MODELSWARD 2015 PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, 2015, : 157 - 164
  • [50] Early number word learning: Associations with domain-general and domain-specific quantitative abilities
    Yang, Meiling
    Liang, Junying
    FRONTIERS IN PSYCHOLOGY, 2022, 13