Advancing 3D medical image analysis with variable dimension transform based supervised 3D pre-training

被引：3

作者：

Zhang, Shu ^{[1
]}

Li, Zihao ^{[1
]}

Zhou, Hong-Yu ^{[2
]}

Ma, Jiechao ^{[1
]}

Yu, Yizhou ^{[1
,2
]}

机构：

[1] Deepwise Artificial Intelligence Lab, 8 Haidian Ave, Beijing, Peoples R China

[2] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China

来源：

NEUROCOMPUTING | 2023年 / 529卷

基金：

中国国家自然科学基金;

关键词：

3D medical image; Transfer learning; Variable dimension transform; Supervised pre-training; CT; NETWORK;

D O I：

10.1016/j.neucom.2023.01.012

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The difficulties in both data acquisition and annotation substantially restrict the sample sizes of training datasets for 3D medical imaging applications. Therefore, it is non-trivial to build well-performing 3D con-volutional neural networks from scratch. Previous efforts on 3D pre-training have frequently relied on self-supervised approaches, which use either predictive or contrastive learning on unlabeled data to build invariant 3D representations. However, because of the unavailability of large-scale supervision informa-tion, obtaining semantically invariant and discriminative representations from these learning frame-works remains problematic. In this paper, we revisit an innovative yet simple fully-supervised 3D network pre-training framework to take advantage of semantic supervision from large-scale 2D natural image datasets. With a redesigned 3D network architecture, reformulated natural images are used to address the problem of data scarcity and develop powerful 3D representations. Comprehensive experi-ments on five benchmark datasets demonstrate that the proposed pre-trained models can effectively accelerate convergence while also improving accuracy for a variety of 3D medical imaging tasks such as classification, segmentation, and detection. In addition, as compared to training from scratch, it can save up to 60% of annotation efforts. On the NIH DeepLesion dataset, it also achieves state-of-the-art detection performance, outperforming earlier self-supervised and fully-supervised pre-training approaches, as well as methods that do training from scratch. To facilitate further development of 3D medical models, our code and pre-trained model weights are publicly available at https://github.com/u rmagicsmine/CSPR. (c) 2023 Elsevier B.V. All rights reserved.

引用

页码：11 / 22

页数：12

共 50 条

[41] 3D Shape Recovery by Aggregating 3D Wavelet Transform-Based Image Focus Volumes Through 3D Weighted Least Squares
Ali, Usman
Mahmood, Muhammad Tariq
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2020, 62 (01) : 54 - 72
[42] Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training
Guo, Ziyu
Zhang, Renrui
Qiu, Longtian
Li, Xianzhi
Heng, Pheng-Ann
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 791 - 799
[43] 3D Shape Recovery by Aggregating 3D Wavelet Transform-Based Image Focus Volumes Through 3D Weighted Least Squares
Usman Ali
Muhammad Tariq Mahmood
Journal of Mathematical Imaging and Vision, 2020, 62 : 54 - 72
[44] Coronary Arteries Segmentation Based on the 3D Discrete Wavelet Transform and 3D Neutrosophic Transform
Chen, Shuo-Tsung
Wang, Tzung-Dau
Lee, Wen-Jeng
Huang, Tsai-Wei
Hung, Pei-Kai
Wei, Cheng-Yu
Chen, Chung-Ming
Kung, Woon-Man
BIOMED RESEARCH INTERNATIONAL, 2015, 2015
[45] Lang3DSG: Language-based contrastive pre-training for 3D Scene Graph prediction
Koch, Sebastian
Hermosilla, Pedro
Vaskevicius, Narunas
Colosi, Mirco
Ropinski, Timo
2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1037 - 1047
[46] Conditional GAN with an Attention-Based Generator and a 3D Discriminator for 3D Medical Image Generation
Jung, Euijin
Luna, Miguel
Park, Sang Hyun
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2021, 12906 LNCS : 318 - 328
[47] Conditional GAN with an Attention-Based Generator and a 3D Discriminator for 3D Medical Image Generation
Jung, Euijin
Luna, Miguel
Park, Sang Hyun
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VI, 2021, 12906 : 318 - 328
[48] Masked Image Modeling Advances 3D Medical Image Analysis
Chen, Zekai
Agarwal, Devansh
Aggarwal, Kshitij
Safta, Wiem
Balan, Mariann Micsinai
Brown, Kevin
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1969 - 1979
[49] Single Image 3D Without a Single 3D Image
Fouhey, David F.
Hussain, Wajahat
Gupta, Abhinav
Hebert, Martial
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1053 - 1061
[50] Morphogenesis-based deformable models in 3D medical image analysis
Roux, C
Ibáñez, L
Hamitouche, C
Boniou, M
MEDICON 2001: PROCEEDINGS OF THE INTERNATIONAL FEDERATION FOR MEDICAL & BIOLOGICAL ENGINEERING, PTS 1 AND 2, 2001, : 452 - 455

← 1 2 3 4 5 →