BiasAsker: Measuring the Bias in Conversational AI System

被引：12

作者：

Wan, Yuxuan ^{[1
]}

Wang, Wenxuan ^{[1
]}

He, Pinjia ^{[2
]}

Gu, Jiazhen ^{[1
]}

Bai, Haonan ^{[1
]}

Lyu, Michael R. ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] Chinese Univ Hong Kong, Shenzhen CUHK Shenzhen, Sch Data Sci, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023 | 2023年

基金：

中国国家自然科学基金;

关键词：

Software testing; conversational models; social bias;

D O I：

10.1145/3611643.3616310

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Powered by advanced Artificial Intelligence (AI) techniques, conversational AI systems, such as ChatGPT, and digital assistants like Siri, have been widely deployed in daily life. However, such systems may still produce content containing biases and stereotypes, causing potential social problems. Due to modern AI techniques' data-driven, black-box nature, comprehensively identifying and measuring biases in conversational systems remains challenging. Particularly, it is hard to generate inputs that can comprehensively trigger potential bias due to the lack of data containing both social groups and biased properties. In addition, modern conversational systems can produce diverse responses (e.g., chatting and explanation), which makes existing bias detection methods based solely on sentiment and toxicity hardly being adopted. In this paper, we propose BiasAsker, an automated framework to identify and measure social bias in conversational AI systems. To obtain social groups and biased properties, we construct a comprehensive social bias dataset containing a total of 841 groups and 5,021 biased properties. Given the dataset, BiasAsker automatically generates questions and adopts a novel method based on existence measurement to identify two types of biases (i.e., absolute bias and related bias) in conversational systems. Extensive experiments on eight commercial systems and two famous research models, such as ChatGPT and GPT-3, show that 32.83% of the questions generated by BiasAsker can trigger biased behaviors in these widely deployed conversational systems. All the code, data, and experimental results have been released to facilitate future research.

引用

页码：515 / 527

页数：13

共 50 条

[41] Engineering Bias in AI
Weber C.
IEEE Pulse, 2019, 10 (01): : 15 - 17
[42] Multimodal Conversational AI A Survey of Datasets and Approaches
Sundar, Anirudh S.
Heck, Larry
PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 131 - 147
[43] ChatClimate: Grounding conversational AI in climate science
Vaghefi, Saeid Ashraf
Stammbach, Dominik
Muccione, Veruska
Bingler, Julia
Ni, Jingwei
Kraus, Mathias
Allen, Simon
Colesanti-Senni, Chiara
Wekhof, Tobias
Schimanski, Tobias
Gostlow, Glen
Yu, Tingyu
Wang, Qian
Webersinke, Nicolas
Huggel, Christian
Leippold, Markus
COMMUNICATIONS EARTH & ENVIRONMENT, 2023, 4 (01):
[44] Semantic and pragmatic precision in conversational AI systems
Bunt, Harry
Petukhova, Volha
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
[45] Disrupted self, therapy, and the limits of conversational AI
Babushkina, Dina
de Boer, Bas
PHILOSOPHICAL PSYCHOLOGY, 2024,
[46] Conversational AI-empowered biophysical analysis
Kumar, Vishesh
Bryan, Shep
Presse, Steve
BIOPHYSICAL JOURNAL, 2024, 123 (03) : 551A - 551A
[47] Aesop: A Visual Storytelling Platform for Conversational AI
Meo, Tim
Raghavan, Aswin
Salter, David A.
Tozzo, Alex
Tamrakar, Amir
Amer, Mohamed R.
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5844 - 5846
[48] ChatClimate: Grounding conversational AI in climate science
Saeid Ashraf Vaghefi
Dominik Stammbach
Veruska Muccione
Julia Bingler
Jingwei Ni
Mathias Kraus
Simon Allen
Chiara Colesanti-Senni
Tobias Wekhof
Tobias Schimanski
Glen Gostlow
Tingyu Yu
Qian Wang
Nicolas Webersinke
Christian Huggel
Markus Leippold
Communications Earth & Environment, 4
[49] Integrating Conversational AI and Machine Learning in Education
Katake, Kanchan Jadhav
Sugandhi, Rekha
SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 4, SMARTCOM 2024, 2024, 948 : 327 - 338
[50] Could a Conversational AI Identify Offensive Language?
da Silva, Daniela America
Borges Louro, Henrique Duarte
Goncalves, Gildarcio Sousa
Marques, Johnny Cardoso
Vieira Dias, Luiz Alberto
da Cunha, Adilson Marques
Tasinaffo, Paulo Marcelo
INFORMATION, 2021, 12 (10)

← 1 2 3 4 5 →