Graph Embeddings for Abusive Language Detection

被引：0

作者：

Cécillon N. ^{[1
]}

Labatut V. ^{[1
]}

Dufour R. ^{[1
]}

Linarès G. ^{[1
]}

机构：

[1] Laboratoire Informatique d’Avignon-LIA EA 4128, Avignon Université, Avignon

来源：

SN Computer Science | 2021年 / 2卷 / 1期

关键词：

Automatic abuse detection; Conversational graph; Graph embedding; Online conversations; Social networks;

D O I：

10.1007/s42979-020-00413-7

中图分类号：

学科分类号：

摘要：

Abusive behaviors are common on online social networks. The increasing frequency of anti-social behaviors forces the hosts of online platforms to find new solutions to address this problem. Automating the moderation process has thus received a lot of interest in the past few years. Various methods have been proposed, most based on the exchanged content, and one relying on the structure and dynamics of the conversation. It has the advantage of being language-independent, however it leverages a hand-crafted set of topological measures which are computationally expensive and not necessarily suitable to all situations. In the present paper, we propose to use recent graph embedding approaches to automatically learn representations of conversational graphs depicting message exchanges. We compare two categories: node vs. whole-graph embeddings. We experiment with a total of 8 approaches and apply them to a dataset of online messages. We also study more precisely which aspects of the graph structure are leveraged by each approach. Our study shows that the representation produced by certain embeddings captures the information conveyed by specific topological measures, but misses out other aspects. © 2021, Springer Nature Singapore Pte Ltd.

引用

共 50 条

[21] Sarcasm and Implicitness in Abusive Language Detection: A Multilingual Perspective
Frenda, Simona
PROCESAMIENTO DEL LENGUAJE NATURAL, 2023, (70): : 239 - 242
[22] Neural Word Decomposition Models for Abusive Language Detection
Bodapati, Sravan Babu
Gella, Spandana
Al-Onaizan, Yaser
THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 135 - 145
[23] Graph Vertex Embeddings: Distance, Regularization and Community Detection
Nowak, Radoslaw
Malkowski, Adam
Cieslak, Daniel
Sokol, Piotr
Wawrzynski, Pawel
COMPUTATIONAL SCIENCE, ICCS 2024, PT VI, 2024, 14937 : 43 - 57
[24] TopoDetect: Framework for topological features detection in graph embeddings
Haddad, Maroun
Bouguessa, Mohamed
SOFTWARE IMPACTS, 2021, 10
[25] Investigating the role of swear words in abusive language detection tasks
Endang Wahyu Pamungkas
Valerio Basile
Viviana Patti
Language Resources and Evaluation, 2023, 57 : 155 - 188
[26] Investigating the role of swear words in abusive language detection tasks
Pamungkas, Endang Wahyu
Basile, Valerio
Patti, Viviana
LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (01) : 155 - 188
[27] Generalizability of Abusive Language Detection Models on Homogeneous German Datasets
Seemann, Nina
Lee, Yeong Su
Höllig, Julian
Geierhos, Michaela
Datenbank-Spektrum, 2023, 23 (01) : 15 - 25
[28] Racial Bias in Hate Speech and Abusive Language Detection Datasets
Davidson, Thomas
Bhattacharya, Debasmita
Weber, Ingmar
THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 25 - 35
[29] A malware detection model based on imbalanced heterogeneous graph embeddings
Li, Tun
Luo, Ya
Wan, Xin
Li, Qian
Liu, Qilie
Wang, Rong
Jia, Chaolong
Xiao, Yunpeng
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
[30] Word Embeddings for Romanian Language and Their Use for Synonyms Detection
Popescu, Claudiu Marius
Rusu, Corneliu
Grama, Lacrimioara
2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 151 - 155

← 1 2 3 4 5 →