Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster

被引:0
|
作者
Calabrese, Agostina [1 ,2 ]
Neves, Leonardo [1 ]
Shah, Neil [2 ]
Bos, Maarten W. [1 ]
Ross, Bjorn [2 ]
Lapata, Mirella [1 ]
Barbieri, Francesco [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Snap Inc, Santa Monica, CA USA
基金
欧洲研究理事会; 英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Content moderators play a key role in keeping the conversation on social media healthy. While the high volume of content they need to judge represents a bottleneck to the moderation pipeline, no studies have explored how models could support them to make faster decisions. There is, by now, a vast body of research into detecting hate speech, sometimes explicitly motivated by a desire to help improve content moderation, but published research using real content moderators is scarce. In this work we investigate the effect of explanations on the speed of real-world moderators. Our experiments show that while generic explanations do not affect their speed and are often ignored, structured explanations lower moderators' decision making time by 7.4%.
引用
收藏
页码:398 / 408
页数:11
相关论文
共 50 条
  • [1] Hate Speech on Social Media
    Guiora, Amos
    Park, Elizabeth A.
    PHILOSOPHIA, 2017, 45 (03) : 957 - 971
  • [2] Hate Speech on Social Media
    Amos Guiora
    Elizabeth A. Park
    Philosophia, 2017, 45 : 957 - 971
  • [3] Identification of Hate Speech in Social Media
    Ruwandika, N. D. T.
    Weerasinghe, A. R.
    2018 18TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) CONFERENCE PROCEEDINGS, 2018, : 273 - 278
  • [4] Hate Speech Prediction on Social Media
    Ammar Aouchiche I.R.
    Boumahdi F.
    Madani A.
    Remmide M.A.
    SN Computer Science, 4 (3)
  • [5] Hate speech in social media activism
    Batista de Melo Filho, Jose Iran
    Cordeiro, Igor Lopes
    de Oliveira Arruda Gomes, Danielle Miranda
    AUSTRAL COMUNICACION, 2022, 11 (01):
  • [6] The Virality of Hate Speech on Social Media
    Maarouf A.
    Pröllochs N.
    Feuerriegel S.
    Proceedings of the ACM on Human-Computer Interaction, 2024, 8 (CSCW1)
  • [7] Tech to Combat Social Media Hate Speech
    Hampson, Michelle
    IEEE SPECTRUM, 2023, 60 (06) : 8 - 8
  • [8] A Measurement Study of Hate Speech in Social Media
    Mondal, Mainack
    Silva, Leandro Araujo
    Benevenuto, Fabricio
    PROCEEDINGS OF THE 28TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT'17), 2017, : 85 - 94
  • [9] Targets and Aspects in Social Media Hate Speech
    Shvets, Alexander
    Fortuna, Paula
    Soler-Company, Juan
    Wanner, Leo
    WOAH 2021: THE 5TH WORKSHOP ON ONLINE ABUSE AND HARMS, 2021, : 179 - 190
  • [10] Hate speech: Uncovering violence on social media
    Batista, Waleska Miguel
    Silva, Fabricio de Martino Costa e
    DIREITO E PRAXIS, 2024, 15 (03):