Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports

被引：0

作者：

Bressem, Keno K. ^{[1
,2
,3
,4
,5
]}

Adams, Lisa C. ^{[1
,2
,3
,4
,5
]}

Gaudin, Robert A. ^{[6
]}

Troeltzsch, Daniel ^{[6
]}

Hamm, Bernd ^{[1
]}

Makowski, Marcus R. ^{[7
]}

Schuele, Chan-Yong ^{[1
]}

Vahldiek, Janis L. ^{[1
]}

Niehues, Stefan M. ^{[1
]}

机构：

[1] Charite, Dept Radiol, D-12203 Berlin, Germany

[2] Charite Univ Med Berlin, D-10117 Berlin, Germany

[3] Free Univ Berlin, D-10117 Berlin, Germany

[4] Humboldt Univ, D-10117 Berlin, Germany

[5] Berlin Inst Hlth, D-10117 Berlin, Germany

[6] Charite, Dept Oral & Maxillofacial Surg, D-12203 Berlin, Germany

[7] Tech Univ Munich, Sch Med, Dept Diagnost & Intervent Radiol, D-81675 Munich, Germany

来源：

BIOINFORMATICS | 2021年 / 36卷 / 21期

关键词：

D O I：

暂无

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Motivation The development of deep, bidirectional transformers such as Bidirectional Encoder Representations from Transformers (BERT) led to an outperformance of several Natural Language Processing (NLP) benchmarks. Especially in radiology, large amounts of free-text data are generated in daily clinical workflow. These report texts could be of particular use for the generation of labels in machine learning, especially for image classification. However, as report texts are mostly unstructured, advanced NLP methods are needed to enable accurate text classification. While neural networks can be used for this purpose, they must first be trained on large amounts of manually labelled data to achieve good results. In contrast, BERT models can be pre-trained on unlabelled data and then only require fine tuning on a small amount of manually labelled data to achieve even better results. Results Using BERT to identify the most important findings in intensive care chest radiograph reports, we achieve areas under the receiver operation characteristics curve of 0.98 for congestion, 0.97 for effusion, 0.97 for consolidation and 0.99 for pneumothorax, surpassing the accuracy of previous approaches with comparatively little annotation effort. Our approach could therefore help to improve information extraction from free-text medical reports. Availability and implementation We make the source code for fine-tuning the BERT-models freely available at https://github.com/fast-raidiology/bert-for-radiology. Supplementary information are available at Bioinformatics online.

引用

页码：5255 / 5261

页数：7

共 50 条

[41] Incident detection and classification in renewable energy news using pre-trained language models on deep neural networks
Wang, Qiqing
Li, Cunbin
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2022, 22 (01) : 57 - 76
[42] Classification of Conversational Sentences Using an Ensemble Pre-Trained Language Model with the Fine-Tuned Parameter
Sujatha, R.
Nimala, K.
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 1669 - 1686
[43] Mobile GUI test script generation from natural language descriptions using pre-trained model
Li, Chun
9TH IEEE/ACM INTERNATIONAL CONFERENCE ON MOBILE SOFTWARE ENGINEERING AND SYSTEMS, MOBILESOFT 2022, 2022, : 112 - 113
[44] Classification of Cervical Spine Fracture and Dislocation Using Refined Pre-Trained Deep Model and Saliency Map
Naguib, Soaad M.
Hamza, Hanaa M.
Hosny, Khalid M.
Saleh, Mohammad K.
Kassem, Mohamed A.
DIAGNOSTICS, 2023, 13 (07)
[45] Investigations on Deep Learning Pre-trained Model VGG-19 Using Transfer Learning for Remote Sensing Image Classification on Benchmark Datasets
Gupta, Nisha
Singh, Jagtar
Singh, Satvir
Joshi, Garima
Mittal, Ajay
ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 127 - 139
[46] Integrating Pre-Trained protein language model and multiple window scanning deep learning networks for accurate identification of secondary active transporters in membrane proteins
Malik, Muhammad Shahid
Ou, Yu-Yen
METHODS, 2023, 220 : 11 - 20
[47] Can using a pre-trained deep learning model as the feature extractor in the bag-of-deep-visual-words model always improve image classification accuracy?
Xu, Ye
Zhang, Xin
Huang, Chongpeng
Qiu, Xiaorong
PLOS ONE, 2024, 19 (02):
[48] Extracting Business Process Entities and Relations from Text Using Pre-trained Language Models and In-Context Learning
Bellan, Patrizio
Dragoni, Mauro
Ghidini, Chiara
ENTERPRISE DESIGN, OPERATIONS, AND COMPUTING, EDOC 2022, 2022, 13585 : 182 - 199
[49] Skin lesion classification on dermatoscopic images using effective data augmentation and pre-trained deep learning approach
Ferhat Bozkurt
Multimedia Tools and Applications, 2023, 82 : 18985 - 19003
[50] Skin lesion classification on dermatoscopic images using effective data augmentation and pre-trained deep learning approach
Bozkurt, Ferhat
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18985 - 19003

← 1 2 3 4 5 →