Graph Embeddings for Abusive Language Detection

被引:0
|
作者
Cécillon N. [1 ]
Labatut V. [1 ]
Dufour R. [1 ]
Linarès G. [1 ]
机构
[1] Laboratoire Informatique d’Avignon-LIA EA 4128, Avignon Université, Avignon
关键词
Automatic abuse detection; Conversational graph; Graph embedding; Online conversations; Social networks;
D O I
10.1007/s42979-020-00413-7
中图分类号
学科分类号
摘要
Abusive behaviors are common on online social networks. The increasing frequency of anti-social behaviors forces the hosts of online platforms to find new solutions to address this problem. Automating the moderation process has thus received a lot of interest in the past few years. Various methods have been proposed, most based on the exchanged content, and one relying on the structure and dynamics of the conversation. It has the advantage of being language-independent, however it leverages a hand-crafted set of topological measures which are computationally expensive and not necessarily suitable to all situations. In the present paper, we propose to use recent graph embedding approaches to automatically learn representations of conversational graphs depicting message exchanges. We compare two categories: node vs. whole-graph embeddings. We experiment with a total of 8 approaches and apply them to a dataset of online messages. We also study more precisely which aspects of the graph structure are leveraged by each approach. Our study shows that the representation produced by certain embeddings captures the information conveyed by specific topological measures, but misses out other aspects. © 2021, Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [21] Sarcasm and Implicitness in Abusive Language Detection: A Multilingual Perspective
    Frenda, Simona
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2023, (70): : 239 - 242
  • [22] Neural Word Decomposition Models for Abusive Language Detection
    Bodapati, Sravan Babu
    Gella, Spandana
    Al-Onaizan, Yaser
    THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 135 - 145
  • [23] Graph Vertex Embeddings: Distance, Regularization and Community Detection
    Nowak, Radoslaw
    Malkowski, Adam
    Cieslak, Daniel
    Sokol, Piotr
    Wawrzynski, Pawel
    COMPUTATIONAL SCIENCE, ICCS 2024, PT VI, 2024, 14937 : 43 - 57
  • [24] TopoDetect: Framework for topological features detection in graph embeddings
    Haddad, Maroun
    Bouguessa, Mohamed
    SOFTWARE IMPACTS, 2021, 10
  • [25] Investigating the role of swear words in abusive language detection tasks
    Endang Wahyu Pamungkas
    Valerio Basile
    Viviana Patti
    Language Resources and Evaluation, 2023, 57 : 155 - 188
  • [26] Investigating the role of swear words in abusive language detection tasks
    Pamungkas, Endang Wahyu
    Basile, Valerio
    Patti, Viviana
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (01) : 155 - 188
  • [27] Generalizability of Abusive Language Detection Models on Homogeneous German Datasets
    Seemann, Nina
    Lee, Yeong Su
    Höllig, Julian
    Geierhos, Michaela
    Datenbank-Spektrum, 2023, 23 (01) : 15 - 25
  • [28] Racial Bias in Hate Speech and Abusive Language Detection Datasets
    Davidson, Thomas
    Bhattacharya, Debasmita
    Weber, Ingmar
    THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 25 - 35
  • [29] A malware detection model based on imbalanced heterogeneous graph embeddings
    Li, Tun
    Luo, Ya
    Wan, Xin
    Li, Qian
    Liu, Qilie
    Wang, Rong
    Jia, Chaolong
    Xiao, Yunpeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [30] Word Embeddings for Romanian Language and Their Use for Synonyms Detection
    Popescu, Claudiu Marius
    Rusu, Corneliu
    Grama, Lacrimioara
    2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 151 - 155