Risk of bias assessment in preclinical literature using natural language processing

被引:11
|
作者
Wang, Qianying [1 ]
Liao, Jing [1 ]
Lapata, Mirella [2 ]
Macleod, Malcolm [1 ]
机构
[1] Univ Edinburgh, Ctr Clin Brain Sci, 49 Little France Crescent, Edinburgh EH16 4SB, Midlothian, Scotland
[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
automatic assessment; natural language processing; preclinical research synthesis; risk of bias;
D O I
10.1002/jrsm.1533
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We sought to apply natural language processing to the task of automatic risk of bias assessment in preclinical literature, which could speed the process of systematic review, provide information to guide research improvement activity, and support translation from preclinical to clinical research. We use 7840 full-text publications describing animal experiments with yes/no annotations for five risk of bias items. We implement a series of models including baselines (support vector machine, logistic regression, random forest), neural models (convolutional neural network, recurrent neural network with attention, hierarchical neural network) and models using BERT with two strategies (document chunk pooling and sentence extraction). We tune hyperparameters to obtain the highest F1 scores for each risk of bias item on the validation set and compare evaluation results on the test set to our previous regular expression approach. The F1 scores of best models on test set are 82.0% for random allocation, 81.6% for blinded assessment of outcome, 82.6% for conflict of interests, 91.4% for compliance with animal welfare regulations and 46.6% for reporting animals excluded from analysis. Our models significantly outperform regular expressions for four risk of bias items. For random allocation, blinded assessment of outcome, conflict of interests and animal exclusions, neural models achieve good performance; for animal welfare regulations, BERT model with a sentence extraction strategy works better. Convolutional neural networks are the overall best models. The tool is publicly available which may contribute to the future monitoring of risk of bias reporting for research improvement activities.
引用
收藏
页码:368 / 380
页数:13
相关论文
共 50 条
  • [31] Automated knowledge extraction from polymer literature using natural language processing
    Shetty, Pranav
    Ramprasad, Rampi
    ISCIENCE, 2021, 24 (01)
  • [32] Using Natural Language Processing for Context Identification in COVID-19 Literature
    Carvalho, Frederico
    Mariano, Diego
    Bomfim, Marcos
    Fiorini, Giovana
    Bastos, Luana
    Abreu, Ana Paula
    Paixao, Vivian
    Santos, Lucas
    Silva, Juliana
    Puelles, Angie
    Silva, Alessandra
    de Melo-Minardi, Raquel Cardoso
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2023, 2023, 13954 : 70 - 81
  • [33] GENDERED LANGUAGE IN NARRATIVE COMMENTS OF LEARNERS: NATURAL LANGUAGE PROCESSING AND GENDER BIAS
    Saker, Katerina
    Klein, Robin
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2022, 37 (SUPPL 2) : 211 - 211
  • [34] Uncovering Demographic Bias in Natural Language Processing Tools for Radiology
    Cai, Wenli
    RADIOLOGY, 2024, 313 (01)
  • [35] The Meaning and Measurement of Bias: Lessons from Natural Language Processing
    Jacobs, Abigail Z.
    Blodgett, Su Lin
    Barocas, Solon
    Daume, Hal, III
    Wallach, Hanna
    FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 706 - 706
  • [36] Nbias: A natural language processing framework for BIAS identification in text
    Raza, Shaina
    Garg, Muskan
    Reji, Deepak John
    Bashir, Syed Raza
    Ding, Chen
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [37] Advancing Natural Language Processing in Educational Assessment
    Zhong, Jiabao
    Min, Qiaoyu
    ASSESSING WRITING, 2024, 60
  • [38] Natural Language Processing for Learning Assessment in STEM
    Caratozzolo, Patricia
    Rodriguez-Ruiz, Jorge
    Alvarez-Delgado, Alvaro
    PROCEEDINGS OF THE 2022 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON 2022), 2022, : 1549 - 1554
  • [39] Natural Language Processing for Construction Management: A Literature Review
    Hussain, Farheen
    Mehta, Siddhant
    Soy, Meta
    Zhang, Jiansong
    CONSTRUCTION RESEARCH CONGRESS 2024: ADVANCED TECHNOLOGIES, AUTOMATION, AND COMPUTER APPLICATIONS IN CONSTRUCTION, 2024, : 607 - 618
  • [40] Natural Language Processing for Materials Informatics of Literature Data
    Katsura, Yukari
    IEEJ Transactions on Fundamentals and Materials, 144 (09): : 350 - 359