Student Mastery or AI Deception? Analyzing ChatGPT's Assessment Proficiency and Evaluating Detection Strategies

被引:0
|
作者
Wang, Kevin [1 ]
Akins, Seth [1 ]
Mohammed, Abdallah [1 ]
Lawrence, Ramon [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Kelowna, BC V1V 2Z3, Canada
关键词
ChatGPT; generative AI; performance; detection; plagarism; CS1; CS2; database;
D O I
10.1109/CSCI62032.2023.00268
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative Al systems such as ChatGPT have a disruptive effect on learning and assessment. Computer science requires practice to develop skills in problem solving and programming that are traditionally developed using assignments. Generative Al has the capability of completing these assignments for students with high accuracy, which dramatically increases the potential for academic integrity issues and students not achieving desired learning outcomes. This work investigates the performance of ChatGPT by evaluating it across three courses (CS1,CS2,databases). ChatGPT completes almost all introductory assessments perfectly. Existing detection methods, such as MOSS and JPlag (based on similarity metrics) and GPTzero (AI detection), have mixed success in identifying AI solutions. Evaluating instructors and teaching assistants using heuristics to distinguish between student and Al code shows that their detection is not. sufficiently accurate. These observations emphasize the need for adapting assessments and improved detection methods.
引用
收藏
页码:1615 / 1621
页数:7
相关论文
共 48 条
  • [1] AI in obstetrics: Evaluating residents' capabilities and interaction strategies with ChatGPT
    Desseauve, David
    Lescar, Raphael
    de la Fourniere, Benoit
    Ceccaldi, Pierre-Francois
    Dziadzko, Mikhail
    EUROPEAN JOURNAL OF OBSTETRICS & GYNECOLOGY AND REPRODUCTIVE BIOLOGY, 2024, 302 : 238 - 241
  • [2] Analyzing student prompts and their effect on ChatGPT's performance
    Sawalha, Ghadeer
    Taj, Imran
    Shoufan, Abdulhadi
    COGENT EDUCATION, 2024, 11 (01):
  • [3] AI in obstetrics: Evaluating residents' capabilities and interaction strategies with ChatGPT: Correspondence
    Daungsupawong, Hinpetch
    Wiwanitkit, Viroj
    EUROPEAN JOURNAL OF OBSTETRICS & GYNECOLOGY AND REPRODUCTIVE BIOLOGY, 2024, 303 : 355 - 355
  • [4] AI in obstetrics: Evaluating residents' capabilities and interaction strategies with ChatGPT: Correspondence
    de la Fourniere, Benoit
    Ceccaldi, Pierre-Francois
    Dziadzko, Mikhail
    Desseauve, David
    EUROPEAN JOURNAL OF OBSTETRICS & GYNECOLOGY AND REPRODUCTIVE BIOLOGY, 2025, 305 : 419 - 420
  • [5] EVALUATING STUDENT LANGUAGE PROFICIENCY THROUGH PERFORMANCE ASSESSMENT TASKS
    Bagiryan, D.
    EDULEARN15: 7TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2015, : 1448 - 1453
  • [6] AI in dermatology: Evaluating ChatGPT's ability to assist in the treatment of skin conditions
    Wu, Hamish
    Mitchell, Natasha
    Roh, Juhee
    Oakley, Amanda
    AUSTRALASIAN JOURNAL OF DERMATOLOGY, 2023, 64 : 8 - 9
  • [7] "Chatting with ChatGPT": Analyzing the factors influencing users' intention to Use the Open AI's ChatGPT using the UTAUT model
    Menon, Devadas
    Shilpa, K.
    HELIYON, 2023, 9 (11)
  • [8] Evaluating and analyzing student labor literacy in China's higher vocational education: an assessment model approach
    Wu, Suhan
    Duan, Jingyi
    Luo, Min
    FRONTIERS IN EDUCATION, 2024, 9
  • [9] A Potential Role for AI: Evaluating ChatGPT's Efficacy in Prioritizing Medical Waiting Lists
    Morcilla, Jericho
    Cao, Jessica Anning
    Fan, Kenneth
    Rahman, Effie
    Khang Ngo
    Patel, Sagar
    Chaudhary, Varun
    Wykoff, Charles Clifton
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [10] Evaluating ChatGPT’s Proficiency in Understanding and Answering Microservice Architecture Queries Using Source Code Insights
    Quevedo E.
    Abdelfattah A.S.
    Rodriguez A.
    Yero J.
    Cerny T.
    SN Computer Science, 5 (4)