Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance

被引:0
|
作者
Wambsganss, Thiemo [1 ]
Su, Xiaotian [2 ]
Swamy, Vinitra [2 ]
Neshaei, Seyed Parsa [2 ]
Rietsche, Roman [1 ]
Kaser, Tanja [2 ]
机构
[1] Bern Univ Appl Sci, Bern, Switzerland
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs) are increasingly utilized in educational tasks such as providing writing suggestions to students. Despite their potential, LLMs are known to harbor inherent biases which may negatively impact learners. Previous studies have investigated bias in models and data representations separately, neglecting the potential impact of LLM bias on human writing. In this paper, we investigate how bias transfers through an AI writing support pipeline. We conduct a large-scale user study with 231 students writing business case peer reviews in German. Students are divided into five groups with different levels of writing support: one classroom group with featurebased suggestions and four groups recruited from Prolific - a control group with no assistance, two groups with suggestions from finetuned GPT-2 and GPT-3 models, and one group with suggestions from pre-trained GPT-3.5. Using GenBit gender bias analysis, Word Embedding Association Tests (WEAT), and Sentence Embedding Association Test (SEAT) we evaluate the gender bias at various stages of the pipeline: in model embeddings, in suggestions generated by the models, and in reviews written by students. Our results demonstrate that there is no significant difference in gender bias between the resulting peer reviews of groups with and without LLM suggestions. Our research is therefore optimistic about the use of AI writing support in the classroom, showcasing a context where bias in LLMs does not transfer to students' responses(1).
引用
收藏
页码:10275 / 10288
页数:14
相关论文
共 50 条
  • [21] She Elicits Requirements and He Tests: Software Engineering Gender Bias in Large Language Models
    Treude, Christoph
    Hata, Hideaki
    2023 IEEE/ACM 20TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2023, : 624 - 629
  • [22] From ChatGPT to Treatment: the Future of AI and Large Language Models in Surgical Oncology
    Ramamurthi, Adhitya
    Are, Chandrakanth
    Kothari, Anai N.
    INDIAN JOURNAL OF SURGICAL ONCOLOGY, 2023, 14 (03) : 537 - 539
  • [23] From ChatGPT to Treatment: the Future of AI and Large Language Models in Surgical Oncology
    Adhitya Ramamurthi
    Chandrakanth Are
    Anai N. Kothari
    Indian Journal of Surgical Oncology, 2023, 14 : 537 - 539
  • [24] Language and cultural bias in AI: comparing the performance of large language models developed in different countries on Traditional Chinese Medicine highlights the need for localized models
    Lingxuan Zhu
    Weiming Mou
    Yancheng Lai
    Junda Lin
    Peng Luo
    Journal of Translational Medicine, 22
  • [25] Language and cultural bias in AI: comparing the performance of large language models developed in different countries on Traditional Chinese Medicine highlights the need for localized models
    Zhu, Lingxuan
    Mou, Weiming
    Lai, Yancheng
    Lin, Junda
    Luo, Peng
    JOURNAL OF TRANSLATIONAL MEDICINE, 2024, 22 (01)
  • [26] (A Comparative Study on Development Models of AI Chinese Language Partners Based on the ERNIE Large Language Model)
    Lian, Weichen
    Zheng, Mingjian
    Xu, Juan
    JOURNAL OF TECHNOLOGY AND CHINESE LANGUAGE TEACHING, 2024, 15 (02): : 35 - 53
  • [27] AUTOMATING ECONOMIC MODELLING: A CASE STUDY OF AI'S POTENTIAL WITH LARGE LANGUAGE MODELS
    Reason, T.
    Rawlinson, W.
    Malcolm, B.
    Klijn, S.
    Langham, J.
    Gimblett, A.
    VALUE IN HEALTH, 2023, 26 (12) : S1 - S1
  • [28] ABScribe: Rapid Exploration & Organization of Multiple Writing Variations in Human-AI Co-Writing Tasks using Large Language Models
    Reza, Mohi
    Laundry, Nathan
    Musabirov, Ilya
    Dushniku, Peter
    Yu, Michael
    Mittal, Kashish
    Grossman, Tovi
    Liut, Michael
    Kuzminykh, Anastasia
    Williams, Joseph Jay
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [29] Comment on "From ChatGPT to Treatment: the Future of AI and Large Language Models in Surgical Oncology"
    Daungsupawong, Hinpetch
    Wiwanitkit, Viroj
    INDIAN JOURNAL OF SURGICAL ONCOLOGY, 2024, 15 (01) : 201 - 201
  • [30] Comment on “From ChatGPT to Treatment: the Future of AI and Large Language Models in Surgical Oncology”
    Hinpetch Daungsupawong
    Viroj Wiwanitkit
    Indian Journal of Surgical Oncology, 2024, 15 : 201 - 201