GMIM: Self-supervised pre-training for 3D medical image segmentation with adaptive and hierarchical masked image modeling

被引：1

作者：

Qi L. ^{[1
]}

Jiang Z. ^{[1
,2
]}

Shi W. ^{[1
,2
]}

Qu F. ^{[1
]}

Feng G. ^{[1
]}

机构：

[1] Department of Computer Science and Technology, Changchun University of Science and Technology, Jilin, Changchun

[2] Zhongshan Institute of Changchun University of Science and Technology, Guangzhou, Zhongshan

来源：

Computers in Biology and Medicine | 2024年 / 176卷

关键词：

Brain tumor segmentation; Masked image modeling; Self-supervised learning;

D O I：

10.1016/j.compbiomed.2024.108547

中图分类号：

学科分类号：

摘要：

Self-supervised pre-training and fully supervised fine-tuning paradigms have received much attention to solve the data annotation problem in deep learning fields. Compared with traditional pre-training on large natural image datasets, medical self-supervised learning methods learn rich representations derived from unlabeled data itself thus avoiding the distribution shift between different image domains. However, nowadays state-of-the-art medical pre-training methods were specifically designed for downstream tasks making them less flexible and difficult to apply to new tasks. In this paper, we propose grid mask image modeling, a flexible and general self-supervised method to pre-train medical vision transformers for 3D medical image segmentation. Our goal is to guide networks to learn the correlations between organs and tissues by reconstructing original images based on partial observations. The relationships are consistent within the human body and invariant to disease type or imaging modality. To achieve this, we design a Siamese framework consisting of an online branch and a target branch. An adaptive and hierarchical masking strategy is employed in the online branch to (1) learn the boundaries or small contextual mutation regions within images; (2) to learn high-level semantic representations from deeper layers of the multiscale encoder. In addition, the target branch provides representations for contrastive learning to further reduce representation redundancy. We evaluate our method through segmentation performance on two public datasets. The experimental results demonstrate our method outperforms other self-supervised methods. Codes are available at https://github.com/mobiletomb/Gmim. © 2024 Elsevier Ltd

引用

共 50 条

[31] Object Adaptive Self-Supervised Dense Visual Pre-Training
Zhang, Yu
Zhang, Tao
Zhu, Hongyuan
Chen, Zihan
Mi, Siya
Peng, Xi
Geng, Xin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 2228 - 2240
[32] Masked image modeling-based boundary reconstruction for 3D medical image segmentation
Liu, Chang
Cheng, Yuanzhi
Tamura, Shinichi
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
[33] PointVST: Self-Supervised Pre-Training for 3D Point Clouds via View-Specific Point-to-Image Translation
Zhang, Qijian
Hou, Junhui
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (10) : 6900 - 6912
[34] Curriculum Self-Supervised Learning for 3D CT Cardiac Image Segmentation
Taher, Mohammad Reza Hosseinzadeh
Ikuta, Masaki
Soni, Ravi
MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 145 - 156
[35] Representation Recovering for Self-Supervised Pre-training on Medical Images
Yan, Xiangyi
Naushad, Junayed
Sun, Shanlin
Han, Kun
Tang, Hao
Kong, Deying
Ma, Haoyu
You, Chenyu
Xie, Xiaohui
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2684 - 2694
[36] Self-supervised 3D medical image segmentation by flow-guided mask propagation learning
Bitarafan, Adeleh
Mozafari, Mohammad
Azampour, Mohammad Farid
Baghshah, Mahdieh Soleymani
Navab, Nassir
Farshad, Azade
MEDICAL IMAGE ANALYSIS, 2025, 101
[37] MimCo: Masked Image Modeling Pre-training with Contrastive Teacher
Zhou, Qiang
Yu, Chaohui
Luo, Hao
Wang, Zhibin
Li, Hao
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4487 - 4495
[38] Self-Supervised Medical Image Segmentation Using Deep Reinforced Adaptive Masking
Xu, Zhenghua
Liu, Yunxin
Xu, Gang
Lukasiewicz, Thomas
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 180 - 193
[39] FedATA: Adaptive attention aggregation for federated self-supervised medical image segmentation
Dai, Jian
Wu, Hao
Liu, Huan
Yu, Liheng
Hu, Xing
Liu, Xiao
Geng, Daoying
NEUROCOMPUTING, 2025, 613
[40] Masked Image Modeling Advances 3D Medical Image Analysis
Chen, Zekai
Agarwal, Devansh
Aggarwal, Kshitij
Safta, Wiem
Balan, Mariann Micsinai
Brown, Kevin
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1969 - 1979

← 1 2 3 4 5 →