The Limits of Abstract Evaluation Metrics: The Case of Hate Speech Detection

被引:14
|
作者
Olteanu, Alexandra [1 ]
Talamadupula, Kartik [1 ]
Varshney, Kush R. [1 ]
机构
[1] IBM Res, Armonk, NY 10504 USA
关键词
Evaluation metrics; hate speech; human-centered metrics;
D O I
10.1145/3091478.3098871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wagstaff (2012) draws attention to the pervasiveness of abstract evaluation metrics that explicitly ignore or remove problem specifics. While such metrics allow practitioners to compare numbers across application domains, they offer limited insight into the impact of algorithmic decisions on humans and their perception of the algorithm's correctness. Even for problems that are mathematically the same, both the real-cost of (mathematically) identical errors, as well as their perceived-cost by users, may significantly vary according to the specifics of each problem domain, as well as of the user perceiving the result. While the real-cost of errors has been considered previously, little attention has been paid to the perceived-cost issue. We advocate for the inclusion of human-centered metrics that elicit error costs from humans from two perspectives: the nature of the error, and the user context. Focusing on hate speech detection on social media, we demonstrate that even when fixing the performance as measured by an abstract metric such as precision, user perception of correctness varies greatly depending on the nature of errors and user characteristics.
引用
收藏
页码:405 / 406
页数:2
相关论文
共 50 条
  • [1] Should there be limits on hate speech?
    Franco, Joshua
    Warburton, Nigel
    INDEX ON CENSORSHIP, 2013, 42 (02) : 150 - 152
  • [2] A Multilingual Evaluation for Online Hate Speech Detection
    Corazza, Michele
    Menini, Stefano
    Cabrio, Elena
    Tonelli, Sara
    Villata, Serena
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2020, 20 (02)
  • [3] Hate Speech and the Limits of Free Speech in the United States
    Gomez Peralta, Hector
    REVISTA MEXICANA DE CIENCIAS POLITICAS Y SOCIALES, 2023, 68 (249): : 281 - 305
  • [4] Enhancing Hate Speech Detection: Evaluation of Classification Models and Techniques
    Dodda, Ratnam
    Putta, Pooja Reddy
    Shulamite, Elthuri Chelsi
    Ashwini, Kalmuri
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 15 - 21
  • [5] Hate Speech Detection in Clubhouse
    Mansourifar, Hadi
    Alsagheer, Dana
    Fathi, Reza
    Shi, Weidong
    Ni, Lan
    Huang, Yan
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2021, 1525 : 341 - 351
  • [6] Profanity and hate speech detection
    Teh, Phoey Lee
    Cheng, Chi-Bin
    International Journal of Information and Management Sciences, 2020, 31 (03): : 227 - 246
  • [7] Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection
    Nozza, Debora
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 907 - 914
  • [8] Multilingual Hate Speech Detection: Innovations in Optimized Deep Learning for English and Arabic Hate Speech Detection
    Hassan AL-Sukhani
    Qusay Bsoul
    Abdelrahman H. Elhawary
    Ziad M. Nasr
    Ahmed E. Mansour
    Radwan M. Batyha
    Basma S. Alqadi
    Jehad Saad Alqurni
    Hayat Alfagham
    Magda M. Madbouly
    SN Computer Science, 6 (3)
  • [9] Hate speech detection in the Arabic language: corpus design, construction, and evaluation
    Ahmad, Ashraf
    Azzeh, Mohammad
    Alnagi, Eman
    Abu Al-Haija, Qasem
    Halabi, Dana
    Aref, Abdullah
    AbuHour, Yousef
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [10] Hate Speech and Distorted Communication: Rethinking the Limits of Incitement
    Sorial, Sarah
    LAW AND PHILOSOPHY, 2015, 34 (03) : 299 - 324