A Dataset of Offensive German Language Tweets Annotated for Speech Acts

被引:0
|
作者
Plakidis, Melina [1 ,2 ]
Rehm, Georg [1 ,2 ]
机构
[1] DFKI GmbH, Alt Moabit 91C, D-10559 Berlin, Germany
[2] Humboldt Univ, Dorotheenstr 24, D-10117 Berlin, Germany
关键词
Speech acts; hate speech detection; offensive language; annotation; corpus annotation;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present a dataset consisting of German offensive and non-offensive tweets, annotated for speech acts. These 600 tweets are a subset of the dataset by (Stru beta et al., 2019) and comprises three levels of annotation, i. e., six coarse-grained speech acts, 23 fine-grained speech acts and 14 different sentence types. Furthermore, we provide an evaluation in both qualitative and quantitative terms. The dataset is made publicly available under a CC-BY-4.0 license.
引用
收藏
页码:4799 / 4807
页数:9
相关论文
共 50 条
  • [31] COMMUNICATIVE-PRAGMATIC SPACE OF DIRECTIVE SPEECH ACTS WITH MODAL VERBS (IN THE GERMAN LANGUAGE)
    Rogozhnikova, I. N.
    VESTNIK VOLGOGRADSKOGO GOSUDARSTVENNOGO UNIVERSITETA-SERIYA 2-YAZYKOZNANIE, 2012, 11 (02): : 85 - 89
  • [32] GerCCT: An Annotated Corpus for Mining Arguments in German Tweets on Climate Change
    Schaefer, Robin
    Stede, Manfred
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6121 - 6130
  • [33] Pashto offensive language detection: a benchmark dataset and monolingual Pashto BERT
    Haq, Ijazul
    Qiu, Weidong
    Guo, Jie
    Tang, Peng
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [34] OLID-BR: offensive language identification dataset for Brazilian Portuguese
    Trajano, Douglas
    Bordini, Rafael H.
    Vieira, Renata
    LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (04) : 1263 - 1289
  • [35] Pashto offensive language detection: a benchmark dataset and monolingual Pashto BERT
    Haq I.
    Qiu W.
    Guo J.
    Tang P.
    PeerJ Computer Science, 2023, 9 : 1 - 26
  • [36] Emojis as anchors to detect Arabic offensive language and hate speech
    Mubarak, Hamdy
    Hassan, Sabit
    Chowdhury, Shammur Absar
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1436 - 1457
  • [37] Hate-Speech and Offensive Language Detection in Roman Urdu
    Rizwan, Hammad
    Shakeel, Muhammad Haroon
    Karim, Asim
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2512 - 2522
  • [38] Towards Automatic Detection and Explanation of Hate Speech and Offensive Language
    Dorris, Wyatt
    Hu, Ruijia
    Vishwamitra, Nishant
    Luo, Feng
    Costello, Matthew
    PROCEEDINGS OF THE SIXTH INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS (IWSPA'20), 2020, : 23 - 29
  • [39] Offensive Language and Hate Speech Detection Based on Transfer Learning
    Touahri, Ibtissam
    Mazroui, Azzeddine
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 300 - 311
  • [40] SPEECH ACTS - A SPEECH PHILOSOPHICAL ESSAY - GERMAN - SEARLE,JR
    FAUCHER, E
    ETUDES GERMANIQUES, 1975, 30 (01): : 87 - 88