A Benchmark Dataset to Distinguish Human-Written and Machine-Generated Scientific Papers

被引:8
|
作者
Abdalla, Mohamed Hesham Ibrahim [1 ]
Malberg, Simon [1 ]
Dementieva, Daryna [1 ]
Mosca, Edoardo [1 ]
Groh, Georg [1 ]
机构
[1] Tech Univ Munich, Sch Computat Informat & Technol, D-80333 Munich, Germany
关键词
text generation; large language models; machine-generated text detection;
D O I
10.3390/info14100522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As generative NLP can now produce content nearly indistinguishable from human writing, it is becoming difficult to identify genuine research contributions in academic writing and scientific publications. Moreover, information in machine-generated text can be factually wrong or even entirely fabricated. In this work, we introduce a novel benchmark dataset containing human-written and machine-generated scientific papers from SCIgen, GPT-2, GPT-3, ChatGPT, and Galactica, as well as papers co-created by humans and ChatGPT. We also experiment with several types of classifiers-linguistic-based and transformer-based-for detecting the authorship of scientific text. A strong focus is put on generalization capabilities and explainability to highlight the strengths and weaknesses of these detectors. Our work makes an important step towards creating more robust methods for distinguishing between human-written and machine-generated scientific papers, ultimately ensuring the integrity of scientific literature.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] A Study on Distinguishing ChatGPT-Generated and Human-Written Orthopaedic Abstracts by Reviewers: Decoding the Discrepancies
    Makiev, Konstantinos G.
    Asimakidou, Maria
    Vasios, Ioannis S.
    Keskinis, Anthimos
    Petkidis, Georgios
    Tilkeridis, Konstantinos
    Ververidis, Athanasios
    Iliopoulos, Efthymios
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (11)
  • [32] Corrupted by Algorithms? How AI-generated and Human-written Advice Shape (Dis)honesty
    Leib, Margarita
    Koebis, Nils
    Rilke, Rainer Michael
    Hagens, Marloes
    Irlenbusch, Bernd
    ECONOMIC JOURNAL, 2024, 134 (658): : 766 - 784
  • [33] AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably
    Porter, Brian
    Machery, Edouard
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [34] Applying the Turing Test to contouring: Are Machine-Generated Contours Indistinguishable From Human Generated Ones?
    Liu, A.
    Jiang, S.
    Sampath, S.
    Amini, A.
    Wong, J. Y. C.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2019, 105 (01): : E136 - E136
  • [35] A comparative study of thematic choices and thematic progression patterns in human-written and AI-generated texts
    Yang, Shu
    Chen, Shukun
    Zhu, Hailin
    Lin, Jiayi
    Wang, Xi
    SYSTEM, 2024, 126
  • [36] M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
    Wang, Yuxia
    Mansurov, Jonibek
    Ivanov, Petar
    Su, Jinyan
    Shelmanov, Artem
    Tsvigun, Akim
    Afzal, Osama Mohammed
    Mahmoud, Tarek
    Puccetti, Giovanni
    Arnold, Thomas
    Aji, Alham Fikri
    Habash, Nizar
    Gurevych, Iryna
    Nakov, Preslav
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 3964 - 3992
  • [37] A Deep Fusion Model for Human vs. Machine-Generated Essay Classification
    Corizzo, Roberto
    Leal-Arenas, Sebastian
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [38] Machine-Generated Hierarchical Structure of Human Activities to Reveal How Machines Think
    Altin, Mahsun
    Gursoy, Furkan
    Xu, Lina
    IEEE ACCESS, 2021, 9 (09): : 18307 - 18317
  • [39] What's the difference between human-written manuscripts versus ChatGPT-generated manuscripts involving "human touch"?
    Matsubara, Shigeki
    Matsubara, Daisuke
    JOURNAL OF OBSTETRICS AND GYNAECOLOGY RESEARCH, 2025, 51 (02)
  • [40] Human-Written vs AI-Generated Texts in Orthopedic Academic Literature: Comparative Qualitative Analysis
    Hakam, Hassan Tarek
    Prill, Robert
    Korte, Lisa
    Lovrekovi, Bruno
    Ostoji, Marko
    Ramadanov, Nikolai
    Muehlensiepen, Felix
    JMIR FORMATIVE RESEARCH, 2024, 8