ChatGPT as Research Scientist: Probing GPT's capabilities as a Research Librarian, Research Ethicist, Data Generator, and Data Predictor

被引:3
|
作者
Lehr, Steven A. [1 ]
Caliskan, Aylin [2 ]
Liyanage, Suneragiri [3 ]
Banaji, Mahzarin R. [3 ]
机构
[1] Cangrade Inc, Watertown, MA 02472 USA
[2] Univ Washington, Informat Sch, Seattle, WA 98195 USA
[3] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
关键词
generative AI; large language models; scientific methods; cognitive science;
D O I
10.1073/pnas.2404328121
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
How good a research scientist is ChatGPT? We systematically probed the capabilities of GPT- 3.5 and GPT- 4 across four central components of the scientific process: as using psychological science as a testing field. In Study 1 (Research Librarian), unlike human researchers, GPT- 3.5 and GPT- 4 hallucinated, authoritatively generating fictional references 36.0% and 5.4% of the time, respectively, although GPT- 4 exhibited an evolving capacity to acknowledge its fictions. In Study 2 (Research Ethicist), GPT- 4 (though not GPT- 3.5) proved capable of detecting violations like p- hacking in fictional research protocols, correcting 88.6% of blatantly presented issues, and 72.6% of subtly presented issues. In Study 3 (Data Generator), both models consistently replicated patterns of cultural bias previously discovered in large language corpora, indicating that ChatGPT can simulate known results, an antecedent to usefulness for both data generation and skills like hypothesis generation. Contrastingly, in Study 4 (Novel Data Predictor), neither model was successful at predicting new results absent in their training data, and neither appeared to leverage substantially new information when predicting more vs. less novel outcomes. Together, these results suggest that GPT is a flawed but rapidly improving librarian, a decent research ethicist already, capable of data generation in simple domains with known characteristics but poor at predicting novel patterns of empirical data to aid future experimentation.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] The Research About Transit Planning Passenger OD Data Generator
    Ming, Yan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MANAGEMENT AND ENGINEERING (CME 2014), 2014, : 366 - 374
  • [22] A Research Computing and Data Capabilities Model for Strategic Decision-Making
    Schmitz, Patrick
    Mizumoto, Claire
    Hicks, John
    Brunson, Dana
    Krovitz, Gail
    Bottum, James R.
    Cutcher-Gershenfeld, Joel
    Wetzel, Karen
    Cheatham, Thomas, III
    PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2020, PEARC 2020, 2020, : 77 - 84
  • [23] Big data analytics capabilities: a systematic literature review and research agenda
    Mikalef, Patrick
    Pappas, Ilias O.
    Krogstie, John
    Giannakos, Michail
    INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2018, 16 (03) : 547 - 578
  • [24] Big data analytics capabilities: a systematic literature review and research agenda
    Patrick Mikalef
    Ilias O. Pappas
    John Krogstie
    Michail Giannakos
    Information Systems and e-Business Management, 2018, 16 : 547 - 578
  • [25] Generative Pre-Trained Transformer (GPT) in Research: A Systematic Review on Data Augmentation
    Sufi, Fahim
    INFORMATION, 2024, 15 (02)
  • [26] ChatGPT as a data analyst: an exploratory study on AI-supported quantitative data analysis in empirical research
    Prandner, Dimitri
    Wetzelhuetter, Daniela
    Hese, Soenke
    FRONTIERS IN EDUCATION, 2025, 9
  • [27] Seafarer citizen scientist ocean transparency data as a resource for phytoplankton and climate research
    Lavender, Samantha
    Beaugrand, Gregory
    Outram, Nicholas
    Barlow, Nigel
    Crotty, David
    Evans, Jake
    Kirby, Richard
    PLOS ONE, 2017, 12 (12):
  • [28] NIH CUTS RESEARCH FUNDING OF SCIENTIST UNDER INVESTIGATION FOR CELL PAPER DATA
    MERVIS, J
    SCIENTIST, 1990, 4 (12): : 9 - 9
  • [29] Research of ZigBee's data security and protection
    Li Chunqing
    Zhang Jiancheng
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 298 - 302
  • [30] Australia's LGBTIQ Research Data Landscape
    Saxby, Karinna
    AUSTRALIAN ECONOMIC REVIEW, 2022, 55 (02) : 290 - 308