ChatGPT as Research Scientist: Probing GPT's capabilities as a Research Librarian, Research Ethicist, Data Generator, and Data Predictor

被引:3
|
作者
Lehr, Steven A. [1 ]
Caliskan, Aylin [2 ]
Liyanage, Suneragiri [3 ]
Banaji, Mahzarin R. [3 ]
机构
[1] Cangrade Inc, Watertown, MA 02472 USA
[2] Univ Washington, Informat Sch, Seattle, WA 98195 USA
[3] Harvard Univ, Dept Psychol, Cambridge, MA 02138 USA
关键词
generative AI; large language models; scientific methods; cognitive science;
D O I
10.1073/pnas.2404328121
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
How good a research scientist is ChatGPT? We systematically probed the capabilities of GPT- 3.5 and GPT- 4 across four central components of the scientific process: as using psychological science as a testing field. In Study 1 (Research Librarian), unlike human researchers, GPT- 3.5 and GPT- 4 hallucinated, authoritatively generating fictional references 36.0% and 5.4% of the time, respectively, although GPT- 4 exhibited an evolving capacity to acknowledge its fictions. In Study 2 (Research Ethicist), GPT- 4 (though not GPT- 3.5) proved capable of detecting violations like p- hacking in fictional research protocols, correcting 88.6% of blatantly presented issues, and 72.6% of subtly presented issues. In Study 3 (Data Generator), both models consistently replicated patterns of cultural bias previously discovered in large language corpora, indicating that ChatGPT can simulate known results, an antecedent to usefulness for both data generation and skills like hypothesis generation. Contrastingly, in Study 4 (Novel Data Predictor), neither model was successful at predicting new results absent in their training data, and neither appeared to leverage substantially new information when predicting more vs. less novel outcomes. Together, these results suggest that GPT is a flawed but rapidly improving librarian, a decent research ethicist already, capable of data generation in simple domains with known characteristics but poor at predicting novel patterns of empirical data to aid future experimentation.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] RadOnc-GPT (gpt-4o) versus human data extraction for prostate cancer clinical research.
    Namazi, Mohammad Javad
    Osorio, Mariana Borras
    Holmes, Jason M.
    Routman, David M.
    Ebner, Daniel
    Wang, Peilong
    Liu, Wei
    Waddle, Mark Raymond
    JOURNAL OF CLINICAL ONCOLOGY, 2025, 43 (5_SUPPL)
  • [42] Librarian Roles in Institutional Repository Data Set Collecting: Outcomes of a Research Library Task Force
    Newton, Mark P.
    Miller, C. C.
    Bracke, Marianne Stowell
    COLLECTION MANAGEMENT, 2010, 36 (01) : 53 - 67
  • [43] Data Platform for the Research and Prevention of Alzheimer's Disease
    An, Ning
    Jin, Liuqi
    Yang, Jiaoyun
    Yin, Yue
    Jiang, Siyuan
    Jing, Bo
    Au, Rhoda
    HEALTHCARE AND BIG DATA MANAGEMENT, 2017, 1028 : 55 - 78
  • [44] An International Extension of Sweeney's Data Privacy Research
    Patterson, Wayne
    Winston-Proctor, Cynthia E.
    ADVANCES IN HUMAN FACTORS IN CYBERSECURITY, 2020, 960 : 28 - 37
  • [45] Data Collection in Dental Research: A Practitioner's Guide
    Peter, Tabitha
    Pendleton, Chandler
    Xie, Xian Jin
    INTERNATIONAL JOURNAL OF ORAL & MAXILLOFACIAL IMPLANTS, 2024, 39 (03) : 342 - 349
  • [46] Copyright's impact on data mining in academic research
    Handke, Christian
    Guibault, Lucie
    Vallbe, Joan-Josep
    MANAGERIAL AND DECISION ECONOMICS, 2021, 42 (08) : 1999 - 2016
  • [47] The research of RFID Middleware's Data Management Model
    Wang Yanyan
    Zhao Xiaofeng
    Wu Yaohua
    Xu Peipei
    2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2008, : 2565 - 2568
  • [48] Research on the data fusion of Syrup's brix sensor
    Sue, JJ
    Meng, YM
    Luo, Y
    Proceedings of the International Conference on Mechanical Engineering and Mechanics 2005, Vols 1 and 2, 2005, : 1297 - 1299
  • [49] Research & development on a new type of data acquisition board for mechanical vibration generator
    Liu, XY
    Shi, R
    PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 808 - 811
  • [50] Big data research is everyone's research-Making epilepsy data science accessible to the global community: Report of the ILAE big data commission
    Josephson, Colin B.
    Aronica, Eleonora
    Beniczky, Sandor
    Boyce, Danielle
    Cavalleri, Gianpiero
    Denaxas, Spiros
    French, Jacqueline
    Jehi, Lara
    Koh, Hyunyong
    Kwan, Patrick
    Mcdonald, Carrie
    Mitchell, James W.
    Rampp, Stefan
    Sadleir, Lynette
    Sisodiya, Sanjay M.
    Wang, Irene
    Wiebe, Samuel
    Yasuda, Clarissa
    Youngerman, Brett
    EPILEPTIC DISORDERS, 2024, 26 (06) : 733 - 752