BiasAsker: Measuring the Bias in Conversational AI System

被引:12
|
作者
Wan, Yuxuan [1 ]
Wang, Wenxuan [1 ]
He, Pinjia [2 ]
Gu, Jiazhen [1 ]
Bai, Haonan [1 ]
Lyu, Michael R. [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Shenzhen CUHK Shenzhen, Sch Data Sci, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Software testing; conversational models; social bias;
D O I
10.1145/3611643.3616310
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Powered by advanced Artificial Intelligence (AI) techniques, conversational AI systems, such as ChatGPT, and digital assistants like Siri, have been widely deployed in daily life. However, such systems may still produce content containing biases and stereotypes, causing potential social problems. Due to modern AI techniques' data-driven, black-box nature, comprehensively identifying and measuring biases in conversational systems remains challenging. Particularly, it is hard to generate inputs that can comprehensively trigger potential bias due to the lack of data containing both social groups and biased properties. In addition, modern conversational systems can produce diverse responses (e.g., chatting and explanation), which makes existing bias detection methods based solely on sentiment and toxicity hardly being adopted. In this paper, we propose BiasAsker, an automated framework to identify and measure social bias in conversational AI systems. To obtain social groups and biased properties, we construct a comprehensive social bias dataset containing a total of 841 groups and 5,021 biased properties. Given the dataset, BiasAsker automatically generates questions and adopts a novel method based on existence measurement to identify two types of biases (i.e., absolute bias and related bias) in conversational systems. Extensive experiments on eight commercial systems and two famous research models, such as ChatGPT and GPT-3, show that 32.83% of the questions generated by BiasAsker can trigger biased behaviors in these widely deployed conversational systems. All the code, data, and experimental results have been released to facilitate future research.
引用
收藏
页码:515 / 527
页数:13
相关论文
共 50 条
  • [1] Bias and Epistemic Injustice in Conversational AI
    Laacke, Sebastian
    AMERICAN JOURNAL OF BIOETHICS, 2023, 23 (05): : 46 - 48
  • [2] Evaluating for Evidence of Sociodemographic Bias in Conversational AI for Mental Health Support
    Yeo, Yee Hui
    Peng, Yuxin
    Mehra, Muskaan
    Samaan, Jamil
    Hakimian, Joshua
    Clark, Allistair
    Suchak, Karisma
    Krut, Zoe
    Andersson, Taiga
    Persky, Susan
    Liran, Omer
    Spiegel, Brennan
    CYBERPSYCHOLOGY BEHAVIOR AND SOCIAL NETWORKING, 2025, 28 (01) : 44 - 51
  • [3] Measuring and Mitigating Bias in AI-Chatbots
    Beattie, Hedin
    Watkins, Lanier
    Robinson, William H.
    Rubin, Aviel
    Watkins, Shari
    2022 IEEE INTERNATIONAL CONFERENCE ON ASSURED AUTONOMY (ICAA 2022), 2022, : 117 - 123
  • [4] Editorial: Conversational AI
    Raaijmakers, Stephan
    Cremers, Anita
    Krahmer, Emiel
    Westera, Matthijs
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [5] The AI Will See You Now: Feasibility and Acceptability of a Conversational AI Medical Interviewing System
    Hong, Grace
    Smith, Margaret
    Lin, Steven
    JMIR FORMATIVE RESEARCH, 2022, 6 (06)
  • [6] THE CONVERSATIONAL COMMUNICATION DATA SCHEME FOR THE SEMIAUTOMATED MEASURING SYSTEM
    GAVRYUSEV, VG
    GORELOV, IV
    GRISHIN, NI
    ERMOLOV, PF
    ERMAKOV, GG
    ZOTKIN, SA
    KOZLOV, VV
    RUKOVICHKIN, VP
    KHOMYAKOV, AK
    SHKURENKOV, AV
    VESTNIK MOSKOVSKOGO UNIVERSITETA SERIYA 3 FIZIKA ASTRONOMIYA, 1986, 27 (01): : 105 - 109
  • [7] Conversational AI: Dialogue Systems, Conversational Agents, and Chatbots
    Seminck, Olga
    COMPUTATIONAL LINGUISTICS, 2023, 49 (01) : 257 - 259
  • [8] Neural Approaches to Conversational AI
    Gao, Jianfeng
    Galley, Michel
    Li, Lihong
    FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2019, 13 (2-3): : 127 - 298
  • [9] Robust and Scalable Conversational AI
    Chen, Yun-Nung
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 431 - 432
  • [10] Lightweight Transformers for Conversational AI
    Pressel, Daniel
    Liu, Wenshuo
    Johnston, Michael
    Chen, Minhua
    2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 221 - 229