BiasAsker: Measuring the Bias in Conversational AI System

被引:12
|
作者
Wan, Yuxuan [1 ]
Wang, Wenxuan [1 ]
He, Pinjia [2 ]
Gu, Jiazhen [1 ]
Bai, Haonan [1 ]
Lyu, Michael R. [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Shenzhen CUHK Shenzhen, Sch Data Sci, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Software testing; conversational models; social bias;
D O I
10.1145/3611643.3616310
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Powered by advanced Artificial Intelligence (AI) techniques, conversational AI systems, such as ChatGPT, and digital assistants like Siri, have been widely deployed in daily life. However, such systems may still produce content containing biases and stereotypes, causing potential social problems. Due to modern AI techniques' data-driven, black-box nature, comprehensively identifying and measuring biases in conversational systems remains challenging. Particularly, it is hard to generate inputs that can comprehensively trigger potential bias due to the lack of data containing both social groups and biased properties. In addition, modern conversational systems can produce diverse responses (e.g., chatting and explanation), which makes existing bias detection methods based solely on sentiment and toxicity hardly being adopted. In this paper, we propose BiasAsker, an automated framework to identify and measure social bias in conversational AI systems. To obtain social groups and biased properties, we construct a comprehensive social bias dataset containing a total of 841 groups and 5,021 biased properties. Given the dataset, BiasAsker automatically generates questions and adopts a novel method based on existence measurement to identify two types of biases (i.e., absolute bias and related bias) in conversational systems. Extensive experiments on eight commercial systems and two famous research models, such as ChatGPT and GPT-3, show that 32.83% of the questions generated by BiasAsker can trigger biased behaviors in these widely deployed conversational systems. All the code, data, and experimental results have been released to facilitate future research.
引用
收藏
页码:515 / 527
页数:13
相关论文
共 50 条
  • [41] Engineering Bias in AI
    Weber C.
    IEEE Pulse, 2019, 10 (01): : 15 - 17
  • [42] Multimodal Conversational AI A Survey of Datasets and Approaches
    Sundar, Anirudh S.
    Heck, Larry
    PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 131 - 147
  • [43] ChatClimate: Grounding conversational AI in climate science
    Vaghefi, Saeid Ashraf
    Stammbach, Dominik
    Muccione, Veruska
    Bingler, Julia
    Ni, Jingwei
    Kraus, Mathias
    Allen, Simon
    Colesanti-Senni, Chiara
    Wekhof, Tobias
    Schimanski, Tobias
    Gostlow, Glen
    Yu, Tingyu
    Wang, Qian
    Webersinke, Nicolas
    Huggel, Christian
    Leippold, Markus
    COMMUNICATIONS EARTH & ENVIRONMENT, 2023, 4 (01):
  • [44] Semantic and pragmatic precision in conversational AI systems
    Bunt, Harry
    Petukhova, Volha
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [45] Disrupted self, therapy, and the limits of conversational AI
    Babushkina, Dina
    de Boer, Bas
    PHILOSOPHICAL PSYCHOLOGY, 2024,
  • [46] Conversational AI-empowered biophysical analysis
    Kumar, Vishesh
    Bryan, Shep
    Presse, Steve
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 551A - 551A
  • [47] Aesop: A Visual Storytelling Platform for Conversational AI
    Meo, Tim
    Raghavan, Aswin
    Salter, David A.
    Tozzo, Alex
    Tamrakar, Amir
    Amer, Mohamed R.
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5844 - 5846
  • [48] ChatClimate: Grounding conversational AI in climate science
    Saeid Ashraf Vaghefi
    Dominik Stammbach
    Veruska Muccione
    Julia Bingler
    Jingwei Ni
    Mathias Kraus
    Simon Allen
    Chiara Colesanti-Senni
    Tobias Wekhof
    Tobias Schimanski
    Glen Gostlow
    Tingyu Yu
    Qian Wang
    Nicolas Webersinke
    Christian Huggel
    Markus Leippold
    Communications Earth & Environment, 4
  • [49] Integrating Conversational AI and Machine Learning in Education
    Katake, Kanchan Jadhav
    Sugandhi, Rekha
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 4, SMARTCOM 2024, 2024, 948 : 327 - 338
  • [50] Could a Conversational AI Identify Offensive Language?
    da Silva, Daniela America
    Borges Louro, Henrique Duarte
    Goncalves, Gildarcio Sousa
    Marques, Johnny Cardoso
    Vieira Dias, Luiz Alberto
    da Cunha, Adilson Marques
    Tasinaffo, Paulo Marcelo
    INFORMATION, 2021, 12 (10)