Exploiting Emojis for Abusive Language Detection

被引:0
|
作者
Wiegand, Michael [1 ]
Ruppenhofer, Josef [2 ]
机构
[1] Alpen Adria Univ Klagenfurt, Digital Age Res Ctr D ARC, AT-9020 Klagenfurt, Austria
[2] Leibniz Inst German Language, D-68161 Mannheim, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose to use abusive emojis, such as the middle finger or face vomiting, as a proxy for learning a lexicon of abusive words. Since it represents extralinguistic information, a single emoji can co-occur with different forms of explicitly abusive utterances. We show that our approach generates a lexicon that offers the same performance in cross-domain classification of abusive microposts as the most advanced lexicon induction method. Such an approach, in contrast, is dependent on manually annotated seed words and expensive lexical resources for bootstrapping (e.g. WordNet). We demonstrate that the same emojis can also be effectively used in languages other than English. Finally, we also show that emojis can be exploited for classifying mentions of ambiguous words, such as fuck and bitch, into generally abusive and just profane usages.
引用
收藏
页码:369 / 380
页数:12
相关论文
共 50 条
  • [1] Exploiting Emojis for Sarcasm Detection
    Subramanian, Jayashree
    Sridharan, Varun
    Shu, Kai
    Liu, Huan
    SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, SBP-BRIMS 2019, 2019, 11549 : 70 - 80
  • [2] BERT-based Approach to Arabic Hate Speech and Offensive Language Detection in Twitter: Exploiting Emojis and Sentiment Analysis
    Althobaiti, Maha Jarallah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 972 - 980
  • [3] Graph Embeddings for Abusive Language Detection
    Cécillon N.
    Labatut V.
    Dufour R.
    Linarès G.
    SN Computer Science, 2021, 2 (1)
  • [4] Exploiting Emojis in Sentiment Analysis: A Survey
    Grover V.
    Journal of The Institution of Engineers (India): Series B, 2022, 103 (1) : 259 - 272
  • [5] Exploiting Emojis in Sentiment Analysis: A Survey
    Grover, Vandita
    Journal of The Institution of Engineers (India): Series B, 2022, 103 (01) : 259 - 272
  • [6] Detection of Abusive Language: the Problem of Biased Datasets
    Wiegand, Michael
    Ruppenhofertt, Josef
    Kleinbauer, Thomas
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 602 - 608
  • [7] Measuring and mitigating language model biases in abusive language detection
    Song, Rui
    Giunchiglia, Fausto
    Li, Yingji
    Shi, Lida
    Xu, Hao
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [8] Abusive Language Detection with Graph Convolutional Networks
    Mishra, Pushkar
    Del Tredici, Marco
    Yannakoudakis, Helen
    Shutova, Ekaterina
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2145 - 2150
  • [9] Joint Modelling of Emotion and Abusive Language Detection
    Rajamanickam, Santhosh
    Mishra, Pushkar
    Yannakoudakis, Helen
    Shutova, Ekaterina
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4270 - 4279
  • [10] Automatic Abusive Language Detection in Urdu Tweets
    Amjad, Maaz
    Ashraf, Noman
    Sidorov, Grigori
    Zhila, Alisa
    Chanona-Hernandez, Liliana
    Gelbukh, Alexander
    ACTA POLYTECHNICA HUNGARICA, 2022, 19 (10) : 143 - 163