The Limits of Abstract Evaluation Metrics: The Case of Hate Speech Detection

被引：14

作者：

Olteanu, Alexandra ^{[1
]}

Talamadupula, Kartik ^{[1
]}

Varshney, Kush R. ^{[1
]}

机构：

[1] IBM Res, Armonk, NY 10504 USA

来源：

PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17) | 2017年

关键词：

Evaluation metrics; hate speech; human-centered metrics;

D O I：

10.1145/3091478.3098871

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Wagstaff (2012) draws attention to the pervasiveness of abstract evaluation metrics that explicitly ignore or remove problem specifics. While such metrics allow practitioners to compare numbers across application domains, they offer limited insight into the impact of algorithmic decisions on humans and their perception of the algorithm's correctness. Even for problems that are mathematically the same, both the real-cost of (mathematically) identical errors, as well as their perceived-cost by users, may significantly vary according to the specifics of each problem domain, as well as of the user perceiving the result. While the real-cost of errors has been considered previously, little attention has been paid to the perceived-cost issue. We advocate for the inclusion of human-centered metrics that elicit error costs from humans from two perspectives: the nature of the error, and the user context. Focusing on hate speech detection on social media, we demonstrate that even when fixing the performance as measured by an abstract metric such as precision, user perception of correctness varies greatly depending on the nature of errors and user characteristics.

引用

页码：405 / 406

页数：2

共 50 条

[1] Should there be limits on hate speech?
Franco, Joshua
Warburton, Nigel
INDEX ON CENSORSHIP, 2013, 42 (02) : 150 - 152
[2] A Multilingual Evaluation for Online Hate Speech Detection
Corazza, Michele
Menini, Stefano
Cabrio, Elena
Tonelli, Sara
Villata, Serena
ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2020, 20 (02)
[3] Hate Speech and the Limits of Free Speech in the United States
Gomez Peralta, Hector
REVISTA MEXICANA DE CIENCIAS POLITICAS Y SOCIALES, 2023, 68 (249): : 281 - 305
[4] Enhancing Hate Speech Detection: Evaluation of Classification Models and Techniques
Dodda, Ratnam
Putta, Pooja Reddy
Shulamite, Elthuri Chelsi
Ashwini, Kalmuri
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 15 - 21
[5] Hate Speech Detection in Clubhouse
Mansourifar, Hadi
Alsagheer, Dana
Fathi, Reza
Shi, Weidong
Ni, Lan
Huang, Yan
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2021, 1525 : 341 - 351
[6] Profanity and hate speech detection
Teh, Phoey Lee
Cheng, Chi-Bin
International Journal of Information and Management Sciences, 2020, 31 (03): : 227 - 246
[7] Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection
Nozza, Debora
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 907 - 914
[8] Multilingual Hate Speech Detection: Innovations in Optimized Deep Learning for English and Arabic Hate Speech Detection
Hassan AL-Sukhani
Qusay Bsoul
Abdelrahman H. Elhawary
Ziad M. Nasr
Ahmed E. Mansour
Radwan M. Batyha
Basma S. Alqadi
Jehad Saad Alqurni
Hayat Alfagham
Magda M. Madbouly
SN Computer Science, 6 (3)
[9] Hate speech detection in the Arabic language: corpus design, construction, and evaluation
Ahmad, Ashraf
Azzeh, Mohammad
Alnagi, Eman
Abu Al-Haija, Qasem
Halabi, Dana
Aref, Abdullah
AbuHour, Yousef
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
[10] Hate Speech and Distorted Communication: Rethinking the Limits of Incitement
Sorial, Sarah
LAW AND PHILOSOPHY, 2015, 34 (03) : 299 - 324

← 1 2 3 4 5 →