Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning

被引:3
|
作者
Aljuhani, Khulood O. [1 ]
Alyoubi, Khaled H. [1 ]
Alotaibi, Fahd S. [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Informat Syst Dept, Jeddah, Saudi Arabia
来源
TEHNICKI GLASNIK-TECHNICAL JOURNAL | 2022年 / 16卷 / 03期
关键词
Arabic Natural Language Processing; Arabic Tweets; Offensive Language Detection; Offensive Language; Word Embeddings;
D O I
10.31803/tg-20220305120018
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In recent years, social media networks are emerging as a key player by providing platforms for opinions expression, communication, and content distribution. However, users often take advantage of perceived anonymity on social media platforms to share offensive or hateful content. Thus, offensive language has grown as a significant issue with the increase in online communication and the popularity of social media platforms. This problem has attracted significant attention for devising methods for detecting offensive content and preventing its spread on online social networks. Therefore, this paper aims to develop an effective Arabic offensive language detection model by employing deep learning and semantic and contextual features. This paper proposes a deep learning approach that utilizes the bidirectional long short-term memory (BiLSTM) model and domain-specific word embeddings extracted from an Arabic offensive dataset. The detection approach was evaluated on an Arabic dataset collected from Twitter. The results showed the highest performance accuracy of 0.93% with the BiLSTM model trained using a combination of domain-specific and agnostic-domain word embeddings.
引用
收藏
页码:394 / 400
页数:7
相关论文
共 50 条
  • [21] Inferring Multilingual Domain-Specific Word Embeddings From Large Document Corpora
    Cagliero, Luca
    La Quatra, Moreno
    IEEE ACCESS, 2021, 9 : 137309 - 137321
  • [22] Folding Domain-Specific Languages: Deep and Shallow Embeddings (Functional Pearl)
    Gibbons, Jeremy
    Wu, Nicolas
    ICFP'14: PROCEEDINGS OF THE 2014 ACM SIGPLAN INTERNATIONAL CONFERENCE ON FUNCTIONAL PROGRAMMING, 2014, : 339 - 347
  • [23] The Evolution of Domain-Specific Computing for Deep Learning
    Neuendorffer S.
    Khodamoradi A.K.
    Denolf K.
    Jain A.K.
    Bayliss S.
    IEEE Circuits and Systems Magazine, 2021, 21 (02) : 75 - 96
  • [24] Detecting White Supremacist Hate Speech Using Domain Specific Word Embedding With Deep Learning and BERT
    Alatawi, Hind S.
    Alhothali, Areej M.
    Moria, Kawthar M.
    IEEE ACCESS, 2021, 9 : 106363 - 106374
  • [25] Domain-specific and domain-general constraints on word and sequence learning
    Archibald, Lisa M. D.
    Joanisse, Marc F.
    MEMORY & COGNITION, 2013, 41 (02) : 268 - 280
  • [26] Domain-specific and domain-general constraints on word and sequence learning
    Lisa M. D. Archibald
    Marc F. Joanisse
    Memory & Cognition, 2013, 41 : 268 - 280
  • [27] Best Practices for Learning Domain-Specific Cross-Lingual Embeddings
    Shakurova, Lena
    Nyari, Beata
    Li, Chao
    Rotaru, Mihai
    4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 230 - 234
  • [28] A domain-specific language for describing machine learning datasets
    Giner-Miguelez, Joan
    Gomez, Abel
    Cabot, Jordi
    JOURNAL OF COMPUTER LANGUAGES, 2023, 76
  • [29] Arbiter: A Domain-Specific Language for Ethical Machine Learning
    Zucker, Julian
    d'Leeuwen, Myraeka
    PROCEEDINGS OF THE 3RD AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY AIES 2020, 2020, : 421 - 425
  • [30] Identification of Domain-Specific Senses Based on Word Embedding Learning
    Wangpoonsarp, Attaporn
    Fukumoto, Fumiyo
    HUMAN LANGUAGE TECHNOLOGY. CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, LTC 2017, 2020, 12598 : 341 - 350